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Preface 


The  International  Symposium  on  Dynamical  Systems,  of  which  this 
volume  is  the  proceedings,  was  held  at  Brown  University,  August  12-16, 
1974  and  was  the  formal  occasion  for  dedicating  the  Lefschetz  Center  for 
Dynamical  Systems  to  the  memory  of  Solomon  Lefschetz.  The  central  theme 
of  the  symposium  was  the  manner  in  which  the  theory  of  dynamical 
systems  continues  to  permeate  current  research  in  ordinary  and  functional 
differential  equations,  and  how  this  approach  and  the  techniques  of  ordinary 
differential  equations  have  begun  to  influence  in  a significant  way  research 
on  certain  types  of  partial  differential  equations  and  evolutionary  equations 
in  general.  This  volume  provides  an  exposition  of  recent  advances,  present 
status,  and  prospects  for  future  research  and  applications. 

The  editors  and  the  Lefschetz  Center  for  Dynamical  Systems  wish  to 
thank  the  Air  Force  Office  of  Scientific  Research,  the  Army  Research 
Office  (Durham),  the  National  Science  Foundation,  the  Office  of  Naval 
Research,  and  Brown  University  for  the  generous  support  that  made  this 
symposium  possible.  The  editors  were  responsible  for  the  program,  and  we 
wish  here  to  express  on  behalf  of  all  the  participants  our  appreciation  to 
H.  Thomas  Banks,  Ettore  F.  Infante,  and  Constantine  Dafermos  for  their 
planning  and  organization  of  the  meeting. 
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number  of  highly  interesting  new  fields,  notably  about  the  superb  Italian 
school  of  algebraic  geometry.” 

Mrs.  Lefschetz  was  a perfect  wife  and  companion  for  Lefschetz.  She,  too, 
had  strong  convictions  and  an  independent  mind.  She  remained  much  in  the 
background.  It  was  most  delightful  to  spend  hours  in  conversation  with 
her  and  Lefschetz.  I once  took  my  young  son  with  me  on  a visit  with  the 
Lefschetz’s  at  1 1 Lake  Lane  in  Princeton.  I had  warned  him  to  be  on  his 
best  behavior  and  to  remember  that  the  Lefschetz’s  were  old  and  not  used 
to  children.  We  were  there  for  almost  four  hours,  and  my  son  was  as  quiet 
as  I have  ever  seen  him.  As  soon  as  we  left  and  were  in  the  car  by 
ourselves,  he  said  to  me  indignantly,  “They  are  not  old!”  During  the  time 
that  Lefschetz  was  traveling  by  plane,  he  and  Mrs.  Lefschetz  never  flew 
together,  even  though  they  had  no  dependents.  I once  asked  Lefschetz 
about  this.  He  said,  "She  stays  home  to  clean  the  house.  She’s  afraid  someone 
will  break  in  while  we  are  gone  and  find  the  place  dirty.” 

Lefschetz  left  Nebraska  after  two  years  and  was  an  instructor  at  the 
University  of  Kansas  from  1913  to  1916,  an  associate  professor  from  1916 
to  1919,  and  a professor  from  1923  to  1925.  Lefschetz  in  describing  his 
research  at  Kansas  on  algebraic  curves  on  hyperelliptic  surfaces  writes: 
“This  [research]  launched  me  into  Poincare-type  topology,  the  1919  Bordin 
Prize  of  the  Paris  Academy,  and  in  1924  Princeton!  . . . The  immediate 
effect  of  the  prize  was  the  Kansas  promotion  (January  1920)  to  associate 
professor  plus  a schedule  reduction.”  Until  that  time  he  had  been  teaching 
eighteen  hours  or  more  a week.  He  later  speaks  wistfully  of"  the  mathematical 
calm  of  Nebraska- Kansas  which  I had  so  enjoyed  without  realizing  it.” 

Well,  he  did  go  to  Princeton  in  1924  and,  as  we  know,  eventually  became 
one  of  the  most  original  and  influential  and  well-recognized  mathematicians 
of  his  generation. 

During  World  War  II  and  at  a time  when  he  was  60  years  old  and  had 
already  made  superb  contributions  to  two  fields  of  mathematics,  he  turned 
his  attention  to  the  subject  of  differential  equations.  His  interest  in  and  his 
promotion  of  differential  equations  lasted  throughout  the  remaining  28  years 
of  his  life.  I visited  Lefschetz  a few  months  before  he  died.  He  was  first 
interested  in  everything  that  was  going  on  at  Brown,  both  mathematically 
and  otherwise.  Next,  he  wanted  me  to  take  him  out  to  dinner. 

He  writes:  "Then  came  World  War  II  and  I turned  my  attention  to 
differential  equations.  With  Office  of  Naval  Research  backing  (1946-1955), 
I conducted  a seminar  on  the  subject  from  which  there  emanated  a number 
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of  really  capable  fellows,  also  a book:  Differential  Equations : Geometric 
Theory  (1957).”  The  acme  of  understatement. 

The  year  that  Lefschetz’  interest  in  differential  equations  began  was  the 
year  of  the  death  of  George  David  Birkhoff  (1884-1944).  Lefschetz  and 
Birkhoff  were  contemporaries  (they  were  the  same  age)  but  were  less  than 
friendly.  It  was  only  much  later  (around  1965)  that  Lefschetz  finally 
recognized  that,  after  Poincare  and  Liapunov,  Birkhoff  was  one  of  the 
cofounders  of  the  geometric  theory  of  differential  equations.  This  was  after 
the  work  of  Moser,  and  after  I had  established  a relationship  between 
Birkhoff’s  limit  sets  and  Liapunov  stability  theory.  William  Hodges  points 
out,  in  relationship  to  this,  an  interesting  aspect  of  Lefschetz’  character. 
“He  [Lefschetz]  asserted  (with  much  truth)  ‘he  made  up  his  mind  in  a flash 
and  found  his  reasons  later.’  Naturally,  he  made  mistakes  this  way,  but  once 
he  was  really  convinced  that  he  was  wrong  he  could  be  extremely  generous.” 

But  to  return  to  1944,  the  kindest  word  for  the  state  of  differential 
equations  within  mathematics  in  the  United  States  is,  perhaps,  dormant. 
At  the  1965  International  Symposium  on  Differential  Equations  and 
Dynamical  Systems  in  Puerto  Rico,  of  which  this  symposium  is  the  successor, 
Lefschetz  in  his  talk  (after  claiming  that  as  an  “amateur”  he  may  have 
“a  broader  view  of  the  field  of  differential  equations  than  its  professionals!”) 
said: 

“The  study  of  geometric  differential  equations  as  a chapter  of  differential 
equations  was  really  founded  by  Henri  Poincare  in  his  classical  “Memoirs” 
on  curves  defined  by  a differential  equation  (1881).  The  cofounders  include 
Liapunov,  who  in  his  great  “Memoire”  created  a true  theory  of  dynamic 
stability,  and  G.  D.  Birkhoff,  who  is  responsible  for  so  many  new  concepts 
in  dynamics. 

“Curiously,  until  recently  the  immense  contributions  of  the  founders  were 
practically  ignored  by  the  mathematical  public  in  general  and  especially  by 
applied  mathematicians  [he  is  speaking  of  the  United  States]  . . . The  reason 
is  not  far  to  seek  . . . almost  universal  linearization  prevailed.”  The  major 
interest  among  electrical  engineers  in  the  United  States  during  the  period 
before  World  War  II  was  in  the  development  of  long  distance  telephony 
and  radio  communication.  The  problems  were  linearized,  no  differential 
equations  were  required,  and  the  mathematics  exploited  was  the  theory  of 
functions  of  a complex  variable.  During  this  time  one  might  add  that 
differential  equations  was  one  of  the  poorest  taught  and  most  boring  of  our 
undergraduate  courses  in  mathematics.  Some  students  who  studied  topology 
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not  only  did  not  know  what  a differential  equation  was,  they  did  not  even 
know  the  meaning  of  a derivative. 

Well,  I do  not  need  to  relate  to  this  audience  what  happened  to  the 
field  of  differential  equations  in  this  country  following  Lefschetz.  I will, 
however,  allow  myself  one  remark.  It  is  interesting  (and  quite  useless)  to 
speculate  about  what  might  have  happened  had  Lefschetz  remained  an 
industrial  engineer.  I feel  sure  of  only  one  thing.  Short  of  an  amendment 
to  the  Constitution  he  would  not  have  become  President  of  the  United 
States.  But  having  left  engineering  and  turned  to  mathematics,  he  made  in 
turn  a profound  contribution  to  engineering.  He  was  responsible  for  intro- 
ducing us  (and  the  whole  Western  world)  to  two  of  the  mainstreams  of 
modern  control  theory:  Liapunov’s  theory  of  stability  and  the  mathematical 
theory  of  optimal  control.  Of  the  latter,  he  was,  through  his  students  and 
“a  number  of  really  capable  fellows”  who  came  through  his  Princeton 
project,  instrumental  in  its  founding. 

After  his  retirement  from  Princeton  in  1953,  he  had  also  to  retire  as 
director  of  his  differential  equations  project  (a  university  rule)  and  during 
the  next  five  years  his  project  was  gradually  phased  out.  The  project  which 
ran  from  1946  through  1959  operated  initially  on  an  annual  grant  of  $25,000 
from  ONR,  later  reduced  to  $20,000  a year  and  shared  for  a few  years 
with  the  Air  Research  and  Development  Command. 

After  attempts  by  Lefschetz  to  found  a research  institute  at  other 
universities  failed,  he  received  carte  blanche  from  the  Martin  Company  in 
November  1957  to  form  a mathematics  center  at  their  Research  Institute 
for  Advanced  Study,  Baltimore,  and  was  charged  with  making  “this  Center 
an  outstanding  example  of  its  kind  in  the  world.”  Lefschetz  was  then 
almost  70  years  old.  Within  three  years  Lefschetz  had  done  this,  and  his 
group  had  achieved  an  international  reputation.  Lefschetz  writes  of  this 
as  follows:  “Very  shortly  we  became  known.  A considerable  number  of 
good  differential  equationists  visited  us,  and  some  few  were  invited  for  a year 
or  so.  After  some  six  years  it  was  necessary  to  transfer  elsewhere.  This 
operation  . . . resulted  in  our  becoming  part  of  the  Division  of  Applied 
Mathematics  at  Brown  University  as  the  “Center  for  Dynamical  Systems” 
with  LaSalle  as  director  and  myself  as  (once  weekly)  visiting  professor.  At 
Brown  our  general  relationship  was  excellent.”  As  with  much  of  Lefschetz’ 
mathematics,  one  would  have  to  dig  deeply  to  comprehend  fully  what  lies 
behind  what  he  has  written.  Suffice  it  to  say  that  we  are  still  grateful  to  the 
Martin  Company  (now  Martin-Marrietta)  for  the  opportunity  RIAS  offered 
us  and  for  the  generous  support  while  we  were  there  from  AFOSR,  ARO, 
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NASA,  and  ONR.  Because  of  an  administrative  ruling  (not  a part  of 
Congress’  basic  law  in  establishing  NSF)  we  were  not  eligible  at  RIAS  for 
NSF  support.  For  this  and  other  reasons,  the  support  of  basic  research 
outside  universities  is  difficult  and  unstable  in  this  country,  and  we  had  a 
strong  desire  to  train  young  people.  Lefschetz  was  instrumental  in  our 
selecting  Brown  as  a place  to  establish  what  is  now  his  Center. 

We  are  lineal  descendants  of  Lefschetz’  Princeton  differential  equations 
project,  and  we  hope  that  this  Center  will  be  a lasting  memorial  to  him  and 
that  it  will  remind  everyone  who  works  and  visits  here,  or  knows  of  us, 
that  we  reside  eternally  in  his  debt. 

In  final  tribute,  I quote  from  my  memorial  to  Solomon  Lefschetz  written 
for  the  IEEE  Transactions  on  Automatic  Control: 

It  was  Solomon  Lefschetz  who  made  the  subject  of  differential  equations 
both  respectable  and  lively  [in  this  country]  and  who  through  his  projects 
at  Princeton  and  RIAS  and  his  association  with  the  Center  for  Dynamical 
Systems  at  Brown  made  it  possible  with  his  boundless  enthusiasm, 
inspiration,  and  guidance  for  many  young  people  to  establish  deep  roots 
in  the  subject.  We  will  remember  always  his  love  of  life,  humor,  self- 
discipline,  courage,  wisdom,  vigor,  incessant  curiosity,  towering  intellect, 
and  creative  genius. 

At  Princeton  the  students  in  their  songs  about  faculty  had  the  following 
verse : 

Here’s  to  Lefschetz  (Solomon  L.) 

Who's  as  argumentative  as  hell, 

When  he’s  at  last  beneath  the  sod 
Then  he’ll  start  to  heckle  God. 

I feel  sure  that  Lefschetz  and  God  are  getting  along  very  well. 


Joseph  P.  LaSalle,  August  1974 
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Chapter  1 : QUALITATIVE  THEORY 


Some  Aspects  of  the  Qualitative  Theory 
of  Differential  Equations 

CHARLES  CONLEY 

Department  ot  Mathematics 

University  of  Wisconsin,  Madison,  Wisconsin 


In  these  remarks  some  problems  of  a qualitative  nature  are  discussed. 
A salient  feature  of  the  problems  is  that  they  do  not  require  any  dilTIcult 
computations;  a definition  of  qualitative  will  not  be  given.  All  the  examples 
here  concern  the  existence  of  orbits  that  are  situated  in  some  special  way ; 
in  a general  sense  they  are  boundary-value  problems. 

In  Fig.  1 a “tube”  with  square  cross  section  is  pictured.  There  is  supposed 
to  be  a differential  equation  whose  solutions  behave  in  the  following  way: 
on  the  top,  bottom,  and  left  end  they  cross  into  the  tube;  on  the  sides 
and  the  right  end  they  cross  out  of  the  tube.  In  particular,  the  orbit  through 
any  boundary  point  leaves  the  tube  in  one  or  the  other  (or  both)  time 
directions  in  the  terms  of  T.  Wazewski  each  boundary  point  is  a point  of 
strict  exit  or  entrance.  Consequently,  the  mapping  that  assigns  to  each  point 
of  the  tube  its  first  point  of  exit  is  continuous,  as  is  the  analogous  mapping 
for  the  backward  flow;  the  mapping  from  entering  to  exiting  points  is  a 
homeomorphism  wherever  it  is  defined. 

In  the  present  case  it  is  assumed  that  there  is  no  invariant  set  in  the 
tube  thus  every  point  is  mapped  both  to  the  exit  set  and  the  entrance  set 
under  the  above  mappings. 

An  orbit  segment  will  be  said  to  cross  the  tube  if  it  runs  from  the  left 
end  to  the  right  end  without  leaving  the  tube.  The  first  question  concerns 
the  existence  of  these.  The  answer  is  that  the  set  of  such  orbits  meets  the 
left  end  in  a set  that  connects  the  top  and  bottom  of  the  tube  and  likewise 
meets  th  right  end  in  a set  that  connects  the  sides  of  the  tube. 

The  main  part  of  the  argument  is  this:  consider  any  arc  in  the  left  end 
that  connects  the  sides;  the  ends  go  out  (immediately)  from  different  sides. 
Since  the  arc  goes  out  through  an  arc  in  the  exit  set,  some  points  of  it  must 
leave  through  the  right  end. 
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Fig.  1. 


Here  is  an  "'application”:  consider  the  equations 

X = X +f(x,  y,  r),  y = -y  + g(x,  y,  r),  t = 1, 

with  the  hypothesis  that  (f2  + g2)/(x2  + y2)  -*  0 as  .x2  + y2 -♦  + oo 
uniformly  in  t e ( — oo,  oo).  The  question  concerns  the  existence  of  bounded 
solutions. 

The  hypothesis  implies  there  is  an  infinitely  long  tube  of  the  form 
{(*.  y*  OIMI.  y|  < Cj,  where  C is  a large  enough  constant.  An  argument 
similar  to  that  above  implies  that  the  set  of  orbits  that  stay  in  the  right 
half  of  this  tube  (from  t = 0)  meets  the  t = 0 section  in  a set  connecting 
the  top  and  bottom.  Likewise,  this  section  contains  a set  connecting  the 
sides  such  that  the  left  half  orbit  through  points  in  this  set  stays  in  the 
left  half  tube.  Since  these  two  connecting  sets  must  intersect,  bounded 
orbits  exist. 

A sequence  of  tubes  will  be  said  to  be  correctly  connected  if  the  right  end 
of  the  kth  overlaps  the  left  end  of  the  (k  + l)st  as  shown  in  Fig.  2.  If  a 
finite  sequence  of  tubes  is  correctly  connected  there  are  orbit  segments 
that  run  from  the  left  end  of  the  first  to  the  right  end  of  the  last  passing 
through  the  tubes  in  sequence  and  contained  in  their  union.  The  proof  is 
based  on  the  observation  that  any  arc  in  the  left  end  of  the  kth  tube  that 
connects  the  sides  contains  a subarc  that  is  carried  to  the  right  end  so  that 
its  image  connects  the  sides  of  the  (k  + 1 )st  tube. 

Given  a bi-infinite  sequence  of  correctly  connected  tubes  such  that  the 
time  to  cross  any  one  is  more  than  (say)  1,  it  follows  that  there  are  orbits 
(at  least  one)  that  are  contained  in  the  union  of  the  tubes  and  pass  through 
them  in  sequence.  If  the  sequence  of  tubes  is  periodic,  then  there  must  be 
a periodic  orbit  in  the  union  of  the  tubes.  The  argument  in  this  case  needs 
an  additional  ingredient. 

These  remarks  apply  to  compact  hyperbolic  invariant  sets  of  smooth 
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Fig.  2. 


flows.  The  application  requires  the  following  definition:  a finite  sequence 
of  orbit  segments  is  called  an  e,  f-chain  from  x to  y if  ( 1 ) the  segments 
are  each  of  time  length  at  least  f,  (2)  x is  within  e/2  of  the  initial  point 
of  the  first  arc,  (3)  the  terminal  point  of  the  kth  arc  is  within  e of  the 
initial  point  of  the  (k  + l)st  arc,  and  (4)  the  terminal  point  of  the  last  arc 
is  within  e/2  of  y.  The  distances  are  chosen  so  that  e,  f-chains  from  x to 
y and  y to  z can  be  put  together  to  make  one  from  x to  z. 

If  H is  a compact  hyperbolic  set.  then  for  sufficiently  small  e there 
corresponds  to  any  e,  r-chain  (in  H ) from  x to  y a sequence  of  tubes,  one 
for  each  segment  in  the  chain,  which  are  correctly  connected  and  which 
approximately  contain  their  corresponding  orbit  segments.  Furthermore,  the 
tubes  have  roughly  the  same  cross-sectional  diameter  and  this  can  be  chosen 
to  go  to  zero  with  e independently  of  the  e,  f-chain. 

This  consequence  is  fairly  evident  from  the  definition  of  hyperbolic: 
namely,  that  the  tangent  flow  restricted  to  the  tangent  bundle  over  H is 
the  sum  of  three  invariant  subbundles,  the  first  spanned  by  contracting 
directions  (vertical  in  Fig.  1),  the  second  by  the  flow  direction  (end  to  end), 
and  the  last  by  expanding  directions  (side  to  side). 

Suppose  now  that  tor  some  x e H there  is  an  £,  f-chain  from  x back  to  x 
all  of  whose  segments  lie  in  H.  Then  (if  r,  is  small  enough)  there  is  a periodic 
orbit  passing  near  x (it  may  not  be  in  H though).  If  an  opseudo-orbit 
means  a bi-infinite  e,  f-chain,  then:  uniformly  close  to  an  e pseudo-orbit 
in  H there  lies  a real  orbit.  For  flows  a more  careful  statement  concerning 
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time  parametri/ation  could  be  given  : for  diffeomorphisms  the  problem  does 
not  arise. 

Actually,  in  this  setting,  bi-infinite  sequences  of  tubes  determine  unique 
orbits  (compare  to  the  classical  theorem  for  the  first  set  of  equations  when 
the  parlials  of  / and  are  small);  thus  one  can  guess  at  the  structural 
stability  statement  for  hyperbolic  invariant  sets.  The  sequence  of  tubes  is 
"stable”  under  C1  perturbations  and  defines  the  orbit  correspondence  at 
least. 

Here  is  a relevant  problem:  suppose  the  nonwandering  set  (of  a smooth 
flow  on  a compact  manifold)  is  a hyperbolic  invariant  set.  Does  it  follow 
that  it  is  the  closure  of  the  periodic  orbits'.’  For  "proof":  first,  every  point 
x of  the  nonwandering  set  is  chain-recurrent,  meaning  that  for  all  t > 0. 
there  is  an  r-chain  from  x back  to  itself  (only  one  segment  is  required). 
As  above,  there  is  a periodic  orbit  near  x.  This  "proof"  is  wrong  because 
the  segments  in  the  chain  may  not  lie  in  the  nonwandering  set. 

Avoiding  this  problem,  define  the  chain-recurrent  set  of  a flow  to  be  the 
set  of  chain-recurrent  points  it  is  a closed  invariant  set  containing  the 
nonwandering  set.  In  contrast  to  the  case  of  the  nonwandering  set.  however, 
if  one  restricts  the  flow  to  its  chain-recurrent  set  R.  then  the  chain-recurrent 
set  of  this  new  flow  is  R itself.  This  means  that  for  x e R.  the  chains  from 
x to  x can  be  chosen  to  lie  in  R.  Thus,  if  the  chain-recurrent  set  is 
hyperbolic,  the  periodic  orbits  are  dense. 

An  axiom  A flow  is  one  such  that  the  nonwandering  set  is  hyperbolic 
and  also  the  closure  of  the  periodic  points.  A reasonable  conjecture  is  as 
follows:  A flow  satisfies  Axiom  A plus  the  strong  transversality  condition 
if  and  only  if  the  chain-recurrent  set  is  hyperbolic. 

One  other  remark  about  //:  if  xe  H is  chain-recurrent  but  not  periodic, 
then  Smale’s  shift  automorphism  is  embedded  near  x;  one  can  use  the  tube 
construction  together  with  two  distinct  periodic  orbits  near  x to  construct 
a mapping  with  all  the  essential  features  of  Smale’s  horseshoe  mapping. 

A problem  in  which  something  close  to  these  tubes  plays  a role  is  the 
following:  Does  the  equation  below  admit  a nontrivial  standing  wave 
solution? 

P,  = Pxx  + s(x)p(l  - P)- 

This  is  really  an  ordinary  differential  equations  problem;  standing  wave 
means  p,  = 0.  Here  x runs  from  - x to  + x.  Given  that  s(x)  approaches 
a negative  limit  at  - x and  a positive  limit  at  + x.  one  proves  that  there 
is  a standing  wave  that  goes  to  0 at  - x and  to  1 at  4-  x.  and  otherwise 
lies  between  0 and  1. 
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Writing  the  equation  for  standing  waves  as  a system,  one  has 
p = q,  q = -s(x)p(\  - p). 

Choose  x ! such  that  s(x)  is  close  to  6 > 0 for  x > x,.  Now  consider  the 
set  ("tube")  {/?,  q.  x|x  > x, ; 0 < p < 1 : q > OJ  (cf.  Fig.  3).  Since  d(x)  > 0 
for  x > x,.  one  finds  that  the  face  \p  = 0;  q > OJ  consists  of  strict  entrance 
points,  while  the  faces  {0  < p < 1 : q = OJ  and  {p  = 1 : q > OJ  consist  of  strict 
exit  points.  With  the  same  type  of  argument  as  applied  before  one  sees 
that  any  arc  from  the  horizontal  exit  face  to  the  vertical  one  must  contain 
a point  that  stays  in  the  tube  as  x increases.  A little  analysis  shows  that 
orbits  through  such  points  satisfy  the  right-hand  boundary  condition  (one 
uses  the  fact  that  ,s(x)  is  not  integrable  near  + x ).  Finally,  an  estimate 
on  the  behavior  of  orbits  with  an  initially  large  q value  shows  that  the  set 
of  orbits  satisfying  the  right-hand  boundary  condition  meets  the  left  end  of 
the  tube  in  a set  that  connects  the  entering  face  to  the  point  p = 1.  q = 0 
(cf.  big.  3). 


q 

i 


\ 

\ 


Fig  3 


Reasoning  in  a similar  way  with  x0  such  that  v(x)  is  close  to  -A  <0 
for  x < x0.  one  obtains  an  analogous  picture  (Fig.  4)  of  the  set  of  points  w ith 
x = x0  that  are  on  orbits  satisfying  the  left-hand  boundary  condition. 

To  span  the  gap  from  x0  to  x,  one  uses  the  slab  JO  < p < 1.  x0  < x < xqj. 
which  also  has  the  essential  property  that  exit  and  entrance  points  are 
strict.  One  finds  the  arc  of  Fig.  4 is  carried  so  that  it  must  intersect  that 
of  Fig.  3.  In  this  example  not  every  boundary  point  of  the  tubes  used  is  an 
entrance  or  exit  point,  but  those  that  are  are  strict,  which  is  the  essential 
thing. 

The  aim  now  is  to  lead  up  to  another  kind  of  "boundary-value" 
problem:  however,  the  route  will  be  somewhat  roundabout  in  order  that 
some  vaguely  relevant  facts  can  be  brought  into  brief  focus. 
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Fig  4 
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An  attractor  means  here  a set  that  is  the  w-limit  set  of  some  compact 
neighborhood  of  itself:  this  is  more  restrictive  than  saying  it  contains  the 
limit  set  of  each  point  in  some  neighborhood  of  itself. 

Define  a '‘block"  to  be  a compact  set  each  one  of  whose  boundary  points 
is  a point  of  strict  exit  or  strict  entrance.  The  definition  of  an  attractor 
is  equivalent  to  saying  it  is  the  maximal  (inclusion)  invariant  set  in  some 
block  all  of  whose  boundary  points  are  entering  points.  For  example,  the 
empty  set  and  the  whole  space  (if  compact)  and  any  closed  open  subset 
of  the  space  are  (trivial)  attractors— they  serve  as  their  own  blocks. 

A filtration  (of  a llow  on  a compact  space)  means  a finite  increasing 
sequence  of  attractors  the  last  of  which  is  the  whole  space.  Thinking  of  the 
use  of  filtration  in  topology,  perhaps  one  should  say  instead  that  it  is  an 
increasing  sequence  of  blocks  for  these  attractors,  but  the  attractors  them- 
selves are  more  relevant  here. 

Each  attractor  has  a dual  repeller  (i.e..  attractor  for  the  reverse  flow). 
Namely,  it  is  the  maximal  invariant  set  in  the  complement  of  any  block 
for  the  attractor. 

The  Morse  sets  of  a filtration  & = {(f)  = A0,  A , A„  = X\  are  the  sets 

Mk  = Ak  n Ak_  , (k  = 1 n),  where  /!*_ , is  the  repeller  dual  to  4k_  ,. 

Here  is  the  theorem:  Let  . //(.?)  be  the  union  of  the  Morse  sets  of 
Then  the  chain-recurrent  set  of  a flow  on  X is  the  intersection  over  all 
filtrations  '/  of  the  corresponding  sets  .//{.?).  In  particular,  every  point  of 
X is  chain-recurrent  if  and  only  if  the  flow  has  no  nontrivial  attractors. 
This  implies  the  existence  of  a "Liapounoff  function”  that  is  constant  on 
components  at  the  chain-recurrent  set  and  otherwise  is  strictly  decreasing 
on  orbits.  We  conjecture  as  follows:  With  the  C 0 topology  on  the  space 
of  flows,  the  set  of  points  of  semicontinuity  of  the  nonwandering  set  function 
is  precisely  the  set  of  points  where  the  nonwandering  set  and  the  chain- 
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recurrent  set  coincide  (the  chain-recurrent  set  function  is  upper  semi- 
continuous  everywhere). 

The  homotopy  index  of  the  Morse  set  Mk  is  the  homotopy  type  [Bk  Bk-  ,] 
meaning  that  obtained  from  a block  Bk  for  Ak  on  collapsing  to  a point  of 
a (smaller)  block  Bk_l  for  /!*_,.  Fortunately,  the  index  does  not  depend 
on  the  block  chosen. 

One  can  pursue  these  thoughts  further  and  prove  a Morse  Smale 
theorem  for  the  Morse  sets  of  a filtration.  The  statement  and  proof  make 
use  of  an  exact  sequence  relating  the  cohomologies  of  the  index,  the  Morse 
set,  and  (sections  of)  the  stable  and  unstable  “manifolds”  of  the  Morse  set. 

The  Morse  sets  Mk  and  Mk+X  are  both  contained  in  the  kth  Morse  set 
of  the  smaller  filtration  { A0 , Ak,  Ak  + 2,  •••i  call  this  set  M.  If 
M = MluM|lt|  then  the  index  of  M is  the  sum  of  those  of  Mk  and  Mk + X, 
where  the  sum  of  pointed  spaces  means  that  obtained  by  pasting  them 
together  at  their  distinguished  point. 

Note  the  following  “boundary-value  problem”:  Does  there  exist  an  orbit 
with  its  a-limit  set  in  Mk+1  and  its  w-limit  set  in  Mk  ? Suppose  the  index 
of  M is  not  the  sum  as  above;  then  the  answer  is  yes.  Namely,  there  must 
then  be  an  orbit  in  M that  is  not  in  Mk+X  u Mk.  That  means  it  lies  in 
Bk , i\Bk.  i but  is  not  contained  in  Bk+  x\Bk  or  Bk\Bk_  t (cf.  Fig.  4).  It  follows 
that  its  a-limit  set  is  in  Bk  + 1\Bk  so  in  Mk+X,  and  its  co-limit  set  is  in 
Bk  Bk.  , so  in  M*_,. 


+1 


FlU. 


5. 


Allowing  a “localization”  of  the  above  notions  wherein  X is  replaced  by 
a block,  we  give  a more  concrete  example.  Consider  a flow  in  the  plane  that 
admits  a disklike  block  B,  as  depicted  in  Fig.  5.  Suppose  there  are  precisely 
two  rest  points  in  this  block,  one  of  which  is  a repeller.  and  further  suppose 
there  is  a continuous  function  on  the  block  that  is  strictly  decreasing  on 
nonconstant  orbits.  Then  there  must  be  an  orbit  in  the  disk  running  from 
the  repeller  to  the  other  rest  point  (see  Fig.  6). 
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A proof  could  go  as  follows:  Let  x0  be  the  repeller  and  for  small  c,  let 
B,  = {x  e B\g(x)  < g(x0)  — e}.  Because  g is  strictly  decreasing  on  non- 
constant orbits,  B,  is  also  a block  (the  new  boundary  points  are  entering 
points).  Now  points  in  the  boundary  component  of  near  x0  must  have 
x0  as  their  a-limit  set. 

This  boundary  component  separates  x0  from  the  boundary  of  the  disk  B; 
in  particular,  it  is  not  homeomorphic  to  a subset  of  an  arc.  But  the  exit  set 
from  the  disk  B is  an  arc;  this  implies  some  point  in  this  boundary 
component  does  not  leave  B.  Thus  the  a>-limit  set  of  this  point  is  in  the  disk 
and  (again  because  g decreases  on  nonconstant  orbits)  therefore  must  be 
the  other  rest  point.  This  orbit  is  the  solution  of  the  boundary-value 
problem. 

More  in  line  with  the  previous  discussion  of  index,  B,  and  B2  = B are 
blocks  corresponding  to  a “local”  filtration  (of  B).  The  (two)  Morse  sets 
are  the  rest  points.  The  repeller  must  have  nonzero  index  (the  index  is, 
in  fact,  a pointed  two-sphere).  The  index  of  B itself  (more  accurately,  of  the 
maximal  invariant  set  in  B)  isO;  namely,  the  homotopy  type  of  the  pointed 
point,  as  follows,  since  the  set  of  exit  points  of  B is  a strong  deformation 
retract  of  B.  Since  0 cannot  be  a sum  one  of  whose  summands  is  not  0, 
the  result  follows. 

Such  examples  arise  in  the  problem  of  existence  of  structure  for  shock 
wave  solutions  of  conservation  laws.  In  another  example,  one  finds  a (local) 
filtration  where  the  four  Morse  sets  have  indices  Z\  L4. 15,  and  (in  order 
increasing  with  the  Morse  sets)  and  where  the  total  index  is  0.*  Purely 
algebraic  considerations  of  a trivial  sort  imply  the  first  and  second  as  well 
as  the  third  and  fourth  Morse  sets  are  connected.  This  is  because  indices 


Here  I"  means  (he  pointed  n-sphere 
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can  cancel  out  pairwise  only  in  special  ways  the  indices  of  Mk . M.  and 
V/*.!  are  related  by  a coexact  sequence;  if  the  index  of  M is  0,  that  of 
Mk + i is  the  suspension  of  that  of  Mk. 

The  machinery  may  seem  a little  heavy  for  the  examples  (as  it  is,  in  fact). 
By  way  of  mitigation  it  can  be  pointed  out  that  it  works  with  no  additional 
effort  in  the  case  where  the  llow  in  the  disk  (say)  is  weakly  coupled  to  a 
system  ol  the  form  x — Ax,  where  A is  a hyperbolic  matrix.  Here  the 
weakly  coupled  llow  means  a small  (C  0)  perturbation  of  the  product  of  the 
two  flows. 

Another  such  boundary-value  problem  arises  in  the  problem  of  traveling 
wave  solutions  of  the  equation 

»r  = >'**  +/(») 

where  / (u)  and  its  integral  F have  graphs  as  depicted  in  Fig.  7. 


f 


F 


Fig.  7. 


A traveling  wave  is  a solution  that  depends  only  on  x + Of,  where  0 is  a 
free  parameter.  Thus  the  solution  satisfies  an  ordinary  differential  equation, 
which,  as  a system,  looks  like 

u = u,  v = 0v-f{u). 

I his  system  has  three  critical  points  two  of  which  are  hyperbolic  and 
one  a repelling  spiral  node  {()  is  assumed  positive).  The  question  is.  Does 
there  exist  a 0 > 0 such  that  the  two  hyperbolic  points  are  connected  by 
an  orbit? 

The  positive  answer  is  apparent  from  the  phase  portraits  for  small  and 
large  0 (Fig.  X).  The  orbit  with  the  arrowhead  must  hit  the  right-hand  rest 
point  for  some  value  of  0 between  those  pictured. 

In  this  case  the  energy  function  r2/2  + F(u)  is  increasing  on  nonconstant 
orbits  and  provides  a local  filtration  with  three  Morse  sets.  In  order,  these 
are  repelling  node;  M2 , left-hand  hyperbolic  point;  and  M,.  right-hand 
hyperbolic  point.  Now.  in  general,  if  M 2 and  M,  are  not  connected  we 
can  choose  a different  filtration  so  that  they  occur  in  reverse  order  M3, 
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M2\  M\'.  where  M2'  = Mlt  M,'  = M2 . If  this  were  possible  for  all  0 , then 
the  index  of  Af.  corresponding  as  described  in  the  previous  example  to  the 
pair  (M2\  M3),  wouid  always  be  the  same.  For  0 small,  the  node  M3  and 
the  right-hand  hyperbolic  point  M2  are  not  connected,  so  the  index  of  M 
is  the  sum  of  the  indices.  On  the  other  hand  for  0 large  there  is  an  orbit 
running  from  the  node  to  the  right-hand  hyperbolic  point  and  one  can 
compute  that  the  index  of  M is  0 in  this  case  (one  can  find  a block  about 
the  connecting  orbit  that  is  a topological  disk  with  an  arc  for  the  exit  set).  It 
follows  that  the  interchange  of  M2  and  Af,  cannot  be  made  for  all  0 and 
this  implies  M2  is  sometimes  connected  to  M,  by  an  orbit. 

This  “proof”  may  be  too  sketchy  for  those  not  familiar  with  the 
approach;  however,  the  result  is  evident  from  Fig.  8.  The  point  to  be  made 
might  be  illustrated  again  by  the  fact  that  the  above  argument  will  work 
when  the  equations  are  weakly  coupled  to  x = Ax,  where  A is  hyperbolic; 
namely,  the  “algebraic”  conditions  are  still  satisfied. 

There  are  several  other  examples  of  boundary-value  problems  where 
these  methods  prove  useful,  among  which  are  the  problem  of  existence 
of  traveling  waves  of  nonlinear  diffusion  equations  of  the  type  of  Nagumo’s 
or  Hodgekin  and  Huxley’s  equation.  These  examples  are  somewhat  too 
complicated  to  include  here  but  (on  paying  attention  to  “fast”  and  “slow” 
variables)  are  composites  of  the  last-mentioned  example  and  the  first  one 
(for  orbits  crossing  a tube). 

Somewhat  different  qualitative  features  are  illustrated  by  the  classical 
Sturm  Liouville  problem,  a nonlinear  version  of  which  can  be  recognized 
in  the  study  of  the  triple  collision  of  the  three-body  problem. 

The  special  aspect  of  Morse  sets  for  Hows  on  vector  bundles  (namely, 
they  are  subbundles  for  flows  over  chain-recurrent  flows)  has  been  studied 
and  application  made,  for  example,  to  the  study  of  linear  almost  periodic 
differential  equations.  These  ideas  also  relate  to  the  study  of  hyperbolic 
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invariant  sets.  For  example,  a compact  chain-recurrent  invariant  set  of  a 
smooth  How  is  hyperbolic  if  and  only  if  there  are  no  (nontrivial)  bounded 
orbits  of  the  tangent  How  over  the  invariant  set.  This,  together  with  the 
equation  for  Jacobi  fields,  provides  a proof  that  flows  on  compact  surfaces 
of  negative  curvature  are  hyperbolic  (chain  recurrence  is  implied  by  volume 
preserving). 

The  notions  of  chain  recurrence,  filiations,  and  Morse  sets  all  carry  over 
to  semiflows  on  compact  metric  spaces.  An  example  may  indicate  how  this 
might  be  useful.  Consider  the  equation  u,  = An  + /(«),  where  the  Laplacian 
acts  on  a bounded  domain  Q with  self-adjoint  boundary  conditions  and 
/(u).  together  with  its  derivative,  is  bounded.  This  equation  generates  a 
semiflow  on  /?(£}).  Suppose  there  are  three  critical  points  [solutions  of 
Am  + /(m)  = 0]  and  that  the  linearized  equations  have  no  eigenvalues  with 
zero  real  part. 

Let  n j,  m2,  and  n3  be  the  number  of  eigenvalues  with  positive  real  part 
at  each  of  the  three  critical  points. 

The  Schauder  Leray  theory  implies  two  of  the  n,  are  even  and  one  is 
odd.  The  Morse  theory  implies  one  of  them  is  zero  and  the  other  two 
differ  by  1 ; it  also  implies  there  are  nonconstant  solutions  of  the  equation 
that  are  bounded  for  all  time  in  both  time  directions  and  that  connect 
the  critical  points  in  fact,  the  dimension  of  the  set  of  such  orbits  is  at  least 
the  largest  of  the  ni . It  is  easy  to  conjecture  that  that  is  the  exact  dimen- 
sion and  that  in  general,  the  set  of  orbits  that  exist  for  all  time  in  both 
time  directions  is  contained  in  some  finite-dimensional  submanifold  of  L2(Ci). 

Recently,  J.  J.  Levin  has  proven  theorems  concerning  asymptotic  solutions 
of  integral  equations  in  which  the  notions  of  filtrations  and  chain 
recurrence  seem  to  play  a role.  It  seems  to  this  author  that  the  techniques 
of  the  qualitative  theory  of  ordinary  differential  equations  will  find  an 
increasing  number  of  applications  in  part’  '1  and  integral  equations. 

Many  authors  have  studied  the  questions  briefly  described  here.  For 
example,  hyperbolic  sets  and  hyperbolicity  in  more  general  situations  have 
been  studied  by  J.  Sinai,  D.  Anosov,  S.  Smale,  C.  Pugh,  M.  Hirsch,  M.  Shub, 

R.  Bowen.  C.  Robinson,  J.  Robbin,  J.  Moser,  Z.  Nitecki,  J.  Franks, 

S.  Newhouse.  D.  Ornstein,  B.  Weiss,  J.  Palis,  N.  Fenechal,  R.  Sacker,  and 
probably  others  of  whom  this  author  is  not  aware.  At  the  basis  of  the 
present  description  lie  the  central  ideas  of  T.  Wazewski.  The  notion  of 
filtrations  has  been  discussed  by  many  authors  already  listed  above;  Z. 
Nitecki,  M.  Shub,  S.  Smale,  and  J.  Palis  are  singled  out  by  virtue  of  the 
fact  that  the  present  author  knows  of  their  work  specifically. 
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The  notion  of  blocks  has  been  discussed  by  R.  Easton.  D.  Rod.  R.  Churchill. 
J.  T.  Montgomery.  J.  Selgrade,  F.  Wilson.  J.  Yorke.  E.  Thomas.  J.  Smoller, 
and  the  present  author. 

Chain  recurrence  and  pseudo-orbits  are  discussed  by  J.  Sinai.  R.  Bowen. 
J.  Selgrade.  and  this  author. 

Almost  all  the  things  alluded  to  here  can  be  found  in  Mathematical  Reviews 
or  in  W.  Gottschalk’s,  “Bibliography  for  Dynamical  Topology"  (Wesleyan 
University).  Three  items  that  have  not  yet  appeared  in  print  are  R.  McGehee's 
treatment  of  the  triple  collision  in  the  three-body  problem  (Irwenliones 
Math.),  G.  Carpenter’s  treatment  of  the  Hodgkin  Huxley  equation,  and 
J.  Levin’s  paper  on  Integrodifferential  equations  (Advances  in  Mathematics). 
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The  Lefschetz  Fixed-Point  Formula; 
Smoothness  and  Stability 


MICHAEL  SHUB 
Department  of  Mathematics 
Queens  College,  Flushing,  New  York 


It  is  a great  honor  and  pleasure  for  me  to  be  speaking  at  the  formal 
occasion  for  dedicating  the  Brown  University  Center  for  Dynamical  Systems 
to  the  memory  of  Solomon  Lefschetz.  Lefschetz  is  surely  one  of  the  towering 
figures  of  mathematics  in  our  century,  and  while  he  is  mainly  thought  of 
for  his  contributions  to  algebraic  geometry  and  topology,  his  contributions 
to  dynamical  systems  have  been  great.  Let  me  mention  just  two  that  are 
of  special  interest  to  me  and  that  I shall  be  talking  about  today.  First, 
there  is  the  Lefschetz  fixed-point  formula.  While  in  some  sense  this  formula 
has  become  part  of  topology,  topology  used  to  be  called  analysis  situs.  As 
Dennis  Sullivan  put  it  to  me  one  day:  “Topology  is  the  subject;  analysis 
is  the  object.”  This,  of  course,  is  a major  theme  from  Poincare  to  the 
present.  Lefschetz  never  lost  sight  of  it.  With  respect  to  the  fixed-point 
formula  one  need  only  read  Lefschetz’s  discussion  of  it  in  the  preface  to 
[13]  or  in  the  introduction  to  [ 14].  In  the  latter,  Lefschetz  refers  to  Poincare’s 
interest  in  fixed  points,  which  came  out  of  dynamics.  In  case  a vector 
field  X has  a manifold  of  section  M,  the  periodic  solutions  of  X correspond 
to  the  periodic  points  of  the  first  return  map  on  M.  This  is,  of  course, 
but  one  simple  application  of  this  very  general  and  powerful  formula. 
Second,  I would  like  to  mention  structural  stability.  Structural  stability 
was  defined  by  Andronov  and  Pontryagin  in  1937  [2]  and  they  stated 
results  on  structural  stability  in  two  dimensions.  While  1 know  of  no  work 
of  Lefschetz  himself  on  structural  stability  we  can  find  his  influence  quite 
clearly  stated  in  the  works  of  DeBaggis  [6]  and  Peixoto  [22],  where  some 
of  the  fundamental  theorems  of  structural  stability  were  first  proven. 


I.  The  Lefschetz  Fixed-Point  Formula  and  Smoothness 

The  Lefschetz  fixed-point  formula  is  known  for  quite  general  spaces  (e  g., 
finite  simplicial  complexes)  but  I will  restrict  my  discussion  here  to  compact 
differentiable  manifolds  without  boundary. 
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If  0 is  an  isolated  fixed  point  of  the  continuous  map  /:  U -*  Rm,  where 
U is  an  open  subset  of  Rm,  then  the  index  of  / at  0.  n f( 0),  is  the  local 
degree  of  the  mapping  Id  - / restricted  to  an  appropriately  small  open  set 
about  0.  If  0 is  an  isolated  fixed  point  of  /'",  then  ar(0)  is  defined  for  all  n. 
If  /:  M -*  M,  f"(p)  = p.  and  p is  an  isolated  fixed  point  for/",  then  or(p) 
is  defined  by  taking  local  coordinates  around  p.  The  Lefschetz  numbers 
L(f)  are  defined  by  the  formula 

L(fn)  = X (~  >)'  tr  /;,:  //,(.W)-//,(M). 

where  //,(M)  is  the  ith  homology  group  of  M with  rational  coefficients 
and/*,  : H,(M)  -■*  H ,(M)  the  map  induced  by /on  these  groups.  (/")*,  = (/*,)". 
so  it  is  not  too  difficult  to  compute  the  L(  /")  once  the  eigenvalues  of  the 
/*,  are  known. 

The  Lefschetz  fixed-point  formula  says  that  the  L()n)  can  be  computed 
in  terms  of  the  fixed-point  indices. 

(1.1)  Lefschetz  fixed-point  theorem 

un=  z °ap) 

p f.  Fix  / * 

provided  that  the  fixed  points  of  /"  are  isolated. 

Of  course,  this  theorem  has  the  immediate  corollary  that  / has  a fixed 
point  if  HI)*  0;  but  it  says  much  more.  Let  me  give  an  example  of  the 
type  that  I -.hall  be  pursuing.  Let  0 < r < x and  Er(M)  denote  the  space 
of  C'  endomorphisms  /:  M M with  the  Cr-topoIogy.  Er(M ) is  a complete 
metric  sp..ce,  and  hence  the  Baire  category  theorem  applies.  A subset 
X E'{M ) is  said  to  be  generic  if  it  contains  the  countable  intersection 
of  open  and  dense  sets. 

Bv  the  Baire  category  theorem  a generic  set  in  Er(M)  is  dense.  If  a property 
is  true  for  a generic  subset  of  E'(M)  we  say  that  it  is  generic  or  that  the 
generic  / has  that  property.  The  simplest  part  of  the  Kupka  Smale  theorem 
(see  [27],  for  example)  shows  that  the  periodic  points  of  period  /i  of  the 
generic  / (i.e..  the  fixed  points  of  /")  are  transversal,  hence  isolated,  and 
the  index  of  any  periodic  point  of  generic  / is  either  + 1 or  —1  Let  N„(f) 
be  the  number  of  period  points  of  period  n of  /. 

(1.2)  Proposition.  For  the  generic  / in  Er(M). 

*V„(/)  > |L(/")|. 

This  is  an  immediate  consequence  of  the  Lefschetz  fixed-point  formula. 
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Example  1.  For  the  generic  C',  J:  Sm  -» S’", 

NJf)  < | deg  (/)"  ± 1 1 

where  the  sign  is  positive  if  m is  even  and  negative  if  m is  odd. 

These  formulas  are  valid  for  all  n and  thus  we  can  try  to  summarize  the 
information  contained  in  them  by  a single  number  that  measures  the 
asymptotic  exponential  growth  rate  of  the  NJ  / ). 


(1.3)  Proposition.  For  the  generic  / in  Er(M), 

lim  sup  ( I n)  log  NJ  / ) ^ lim  sup  ( 1 /n)  log  |L(/")|. 

Another  way  to  express  the  number  lim  sup  (\/n)  log  NJ  / ) is  that  it  is 
the  reciprocal  of  the  radius  of  convergence  of  the  Artin  Ma/ur  /eta  function 


;,(/)  = exp 


/ 

I NJi)rm  . 


Neither  Proposition  I nor  2 holds  for  all  continuous,  Lipschitz.  or  even 
piecewise  linear  mappings. 


Example  2.  Let  /:  .S'2  -♦  S2  be  the  map  of  the  Riemann  sphere  defined  by 
r -*  2:2  z Then  deg  ( / ) = 2.  but  / has  only  two  periodic  points  0 and  / 
So  NJ  I ) = 2 for  all  n.  whereas  /.(/")  = 2"  4-  I This  / is  Lipschitz  but  not 
piecewise  linear.  There  are  piecewise  linear  homeomorphisms  of  manifolds 
even  that  do  not  satisfy  this  property.  The  Lefschetz  formula  is  valid,  of 
course,  in  the  case  nr(0)  4 nr(  / ) = 2"  4-  I It  is  not  difficult  to  calculate 
that  ar( 0)  = 2”. 

Let  me  digress  for  a moment  to  emphasize  the  power  of  formulas  like 
Proposition  (1.3)  In  principal  if  we  are  to  know  or  even  to  estimate  the 
growth  rate  of  the  number  of  periodic  points  of  / of  period  n for  all  n,  we 
must  know  / precisely  and  we  must  iterate  it  infinitely  often.  On  the  other 
hand,  lim  sup  (I  n)  log  ;/.(/")(  can  be  calculated.  If  we  know  the  manifold 
V/  and  if  we  know  / even  approximately,  then  by  simplicial  approximation 
the  eigenvalues  of  the  /„,  can  be  calculated,  and  lim  sup  (I  n)  log  \L(fn)\ 
can  be  determined  If.  for  example,  the  /„,  have  a unique  eigenvalue  of 
maximal  modulus.  /.,  then  lim  sup  (1  n)  log  )L(  /")|  = log  |z.|.The  total  effect 
is  to  replace  a nonlinear  problem  involving  infinite  iterations  by  a finite 
combinatorial  problem  and  to  make  it  linear. 
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Example  2 shows  that  Propositions  (1.2)  and  (1.3)  fail  for  C°  endcmor- 
phisms  of  manifolds.  In  fact,  there  are  only  two  periodic  points,  while  the 
Lefschetz  numbers  tend  to  infinity.  But  surprisingly  enough  it  is  not  known 
if  Proposition  (1.3)  is  true  for  smooth  maps. 

Problem  1 [34].  Let  f:  M —>  M be  smooth.  Is  lim  sup  (1/n)  log  N„(/)  > 

lim  sup  (1/n)  log  |L(/")|? 

It  is  only  recently  that  effects  of  smoothness  on  the  Lefschetz  formula 
have  begun  to  be  studied. 

(1.4)  Proposition  [34].  Suppose  f:U->Rm  is  Cl  and  that  0 is  an 
isolated  fixed  point  of f"  for  all  n.  Then  oy»( 0)  is  bounded  as  a function  of  n. 

The  Lefschetz  trace  formula  now  tells  us: 

(1.5)  Corollary  [34].  If/:  M ->  M is  C1  and  the  Lefschetz  numbers 
L(J")  are  not  bounded  then  the  set  of  periodic  points  of  /is  infinite. 

So  the  phenomenon  of  Example  2 cannot  occur  for  C1  mappings.  Any 
C1  mapping/:  S2  -*•  S2  with  degree  2 has  infinitely  many  periodic  points. 

(1.5')  Corollary.  Suppose  the  C1  vector  field  X on  N has  a C1  manifold 
of  section  M,  with  first  return  map  /:  M ->  M.  If  the  Lefschetz  numbers 
L(f")  are  unbounded,  then  X has  infinitely  many  periodic  solutions. 

Example  3.  To  illustrate  Corollaries  ( 1 .5)  and  ( 1 .5  ) we  consider  what  has 
come  to  be  called  the  Thom  diffeomorphism  of  the  two-dimensional  torus 
T2.  Think  of  T2  as  R2  mod  the  integer  lattice  Z2  cr  R2.  T2  = R2/Z2.  The 
matrix  A = (f[)  has  integer  entries  and  determinant  one,  so  A defines  a 
map  of  R2/Z 2 that  is  invertible,  i.e.,  a diffeomorphism.  Denote  the  map  of 
T2  it  defines  by  A again. 

A:  T2  -»  T2.  The  homology  groups  of  T2  with  real  coefficients  are 
H0(T2,  R)  = R,  //[(T2,  R)  = R2,  H2(T2,R)  = R. 

A0 * and  A are  the  identity  transformations  and  A1j¥:  R2  R2  is  just  A 
itself.  Thus  the  Lefschetz  numbers  L(A")  = 2 — tr  A".  The  eigenvalues  of  A 
are  (3  ± y/5)/2.  So 

(1)  The  L(A")  are  unbounded;  in  fact, 

(2)  lim  sup  (1/n)  log  \L(An)\  = log  [(3  + Js)/!]. 
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Consequently: 

(a)  Any  C1/ homotopic  to  A has  infinitely  many  periodic  points. 

(b)  Any  C1  vector  field  X that  has  a C1,  T2  manifold  of  section  with 
first  return  map  homotopic  to  A has  infinitely  many  periodic  solutions. 

(c)  The  generic  / in  Er(T2)  that  is  homotopic  to  A has 

lim  sup  (1/m)  log  N„(f)  > log  [(3  + sj5)/2\ 

so  the  number  of  periodic  points  of / of  period  n is  growing  exponentially 
with  n. 


II.  Structural  Stability 

The  theory  of  structural  stability  (and  0-stability)  for  M-dimensional 
manifolds  has  made  remarkable  progress  in  the  last  fifteen  years.  We  recall 
these  concepts  briefly. 

(2.1)  Definition.  If  X and  Y are  topological  spaces,  /:  X -» X and 
g:  Y ->  Y are  continuous,  then/ and  g are  topologically  conjugate  iff  there 
is  a surjective  homeomorphism  h:  X -+  Y such  that  gh  = hf 

(2.2)  Definition.  IfX  is  a topological  space  and/:  X->X  is  continuous, 
the  nonwandering  set  of  / 0(/)  = {ij/  e X\  given  any  neighborhood  U ^ in 
X there  is  an  m > 0 such  that  /"(U^)  n U*  * 0}- 

O (/)  is  closed,  f(Q(  f))  <=  fi(/),  and  if/ is  a homeomorphism  f(Q(  f))  = 
Q(/)  and  fi(/)  = fi(/“  ').  Q(/)  contains  all  the  periodic  points  of/and  all 
the  ro-limit  points  of/  So  as  Smale  has  said  fi(/)  is  where  all  the  action  is. 

(2.3)  Definition,  (a)  / e DifTr  (Af)  is  structurally  stable  if  there  is  a 
neighborhood  Vf  of  / in  Diffr  (M)  such  that  any  g e Vf  is  topologically 
conjugate  to/ 

(b)  fe  Diffr  (M)  is  Q-stable  if  there  is  a neighborhood  Vf  of/  in  Diffr  ( M ) 
such  that  for  any  g e Vf,  and  g/Cl(g)  are  topologically  conjugate. 

Any  structurally  stable  diffeomorphism  is  fi-stable,  but  there  are  f2-stable 
diffeomorphisms  that  are  not  structurally  stable.  This  was  proven  by  Smale, 
which  is  why  he  defined  O-stability.  For  much  more  discussion  of  structural 
and  O-stability  and  examples  of  these  phenomena  one  should  read  Smale’s 
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original  survey  article  on  the  subject  [39],  and  one  should  consult  Nitecki’s 
book  [18]  as  well  as  the  further  survey  articles  [29,  40,  42],  Lefschetz  has 
perhaps  made  the  strongest  claim  for  the  importance  of  structural  stability 
itself  [15.  p.  250]: 

In  a system  of  differential  equations  arising  out  of  a practical  problem 
the  coefficients  are  only  known  approximately:  various  errors  are 
inevitably  involved  in  their  determination.  The  phase  portrait  of  the 
system  must  therefore  be  such  that  it  is  not  affected  by  small  modifica- 
tions in  the  coefficients.  In  other  words  it  must  be  “stable”  under 
these  conditions. 

Lefschetz  was,  of  course,  talking  about  the  definition  of  structural  stability 
for  differential  equations,  but  the  same  may  be  said  for  diffeomorphisms. 
Insisting  that  the  systems  must  be  stable  in  the  sense  of  structurally  stable 
is  perhaps  extreme.  Other  authors,  for  example  Thom,  content  themselves 
with  less.  Lefschetz  was  perhaps  influenced  by  the  situation  for  two  manifolds 
that  he  was  talking  about,  but  in  any  case  his  statement  gives  the  flavor 
of  the  importance  of  structurally  stable  systems,  and  much  the  same  can  be 
said  for  Q-stable  systems.  Moreover,  while  it  is  to  be  expected  that  structurally 
stable  diffeomorphisms  exhibit  the  generic  behavior  of  nearby  systems,  they 
will  not  have  topological  properties  that  are  not  generic.  Thus  structurally 
stable  diffeomorphisms  avoid  pathological  behavior  and  should  be  fairly 
simple  to  describe.  Since  Diffr  (M)  is  separable,  there  are  only  countably 
many  structurally  stable  systems  up  to  topological  conjugacy,  and  one  might 
hope  to  find  invariants  that  give  a fairly  accurate  picture  of  the  orbit 
structure  of  structurally  stable  systems. 

To  give  a brief  example,  if  fe  Diffr  (M)  is  Q-stable  then  Nn(J' ) = N„(g) 
for  all  n and  all  g in  some  Cr  neighborhood  of  f.  If  we  now  apply 
Proposition  (1.3)  we  have 


(2.4)  Proposition.  If  J'e  Diffr  (M)  is  Q-stable,  then 

lim  sup  (1/n)  log  N„(f)  > lim  sup  (1/n)  log  \L(fn)\. 

Now  we  are  faced  with  an  alternative:  either  there  is  no  diffeomorphism 
f:  T2  -»  T2  that  is  even  homologous  to  the  Thom  diffeomorphism  (see 
Example  3)  and  is  structurally  stable  or  there  are  structurally  stable  diffeo- 
morphisms with  an  infinite  number  of  periodic  points.  This  was  the  motiva- 
tion for  Thom's  example,  and  with  historical  hindsight  the  situation  is 
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clear  enough.  But  at  the  time  it  was  thought  that  the  structurally  stable 
diffeomorphism  might  be  open  and  dense  on  all  manifolds  and  would  have 
only  finitely  many  periodic  points.  The  reason  for  this  is  that  Peixoto  had 
proven  the  openness  and  density  of  structurally  stable  difTeomorphisms  in 
DifP  (S')  and  had  shown  that  they  had  only  finitely  many  periodic  points. 

Actually,  Smale  constructed  a structurally  stable  diffeomorphism  of  S2 
with  infinitely  many  periodic  points  [37],  and  Anosov  [3]  later  proved  that 
the  Thom  diffeomorphism  was  structurally  stable.  The  difference  between 
Sl,  S2,  and  T 2 can  be  seen  in  the  following  remark.  If  f:  Sm  -*  Sm  is  a 
diffeomorphism,  then  | L(fn)\  = 0 or  2 for  all  n. 

Thus  Thom’s  example  shows  that  infinitely  many  periodic  points  can  be 
forced  for  homological  reasons,  because  of  the  Lefschetz  fixed-point  formula. 
Smale’s  example  shows  that  they  are  not  pathological  and  that  they  occur 
in  structurally  stable  difTeomorphisms  for  local  as  well  as  global  topological 
reasons.  I have  dwelt  on  the  Thom  diffeomorphism  because  I will  investigate 
below  further  homological  restrictions  on  the  growth  rate  of  the  number  of 
periodic  points  of  the  O-stable  difTeomorphisms  that  are  understood.  But 
first.  I have  to  present  the  picture  that  evolved  through  the  work  of  Smale 
and  others  describing  O-stable  and  structurally  stable  difTeomorphisms. 

(2.5)  Axiom  A [39].  / g Diff'  ( M ) satisfies  Smale’s  Axiom  A if  and 
only  if 

(a)  0(  / ) has  a hyperbolic  structure, 

(b)  0(  /)  is  the  closure  of  the  periodic  points  off. 

That  0(  / ) has  a hyperbolic  structure  means  that  TM  |0(/).  the  tangent 
bundle  of  M restricted  to  O (/),  may  be  written  as  the  direct  sum  of  two 
Tf  invariant  subbundles  £"  ® £u  such  that  there  exist  constants  0 < / < 1, 
0 < c,  and 

Tfn\E'  <cv."  for  n > 0, 

|7’/"|£u  <cv."  for  n < 0. 

When  / satisfies  Axiom  A.  Smale  proved  the  existence  of  a “spectral 
decomposition"  for  Q (/). 

(2.6)  Theorem  [39].  If/  e Diffr  (M)  satisfies  Axiom  A,  then  ft(/)  is 
the  disjoint  union  fl(/)  = Q,  u •••  u fik . where  each  Q,  is  closed  and  invari- 
ant for/and  / |Qf  is  topologically  transitive. 
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(2.7)  Corollary  [39].  If/:  M M is  as  above,  then  M is  the  disjoint 
union  of 

(a)  M = u WW,) 

i=  1 

where 

W/S(£2I)  = {i J/ e M|/m((/i)-»Q,  as  m -*  x}. 

(b)  M = U W/U(Q1) 

i = 1 

where 

W^Q,)  = {t/i  e M|  /m(t/i)  -»fiasm-*  -oo}. 

Under  the  circumstances  we  can  define  Q , > if  (VT“(t2,)  - Q,-)  n 

(W'(ft;)  - J2,)  / 0.  ./  is  said  to  have  no  cycles  if  Q,o  > £2,,  > > ft,  = O,o 

is  impossible  for  any  / > 1.  Actually  it  is  always  impossible  for  f2,  > for 
an  Axiom  A difieomorphism  as  Smale  knew,  so  we  can  say  j > 2 if  we  want. 

We  can  finally  state  Smale s Q stability  theorem: 

(2.8)  Theorem  (Smale  [41]).  If/ satisfies  Axiom  A and  has  no  cycles 
then  / is  i2-stable. 

The  converse  to  this  theorem,  which  was  conjectured  by  Smale,  is  one  of 
the  outstanding  problems  of  dynamical  systems. 

(2.9)  Conjecture  (Smale  [41 , 42]).  / is  il-stable  ifT/ satisfies  Axiom  A 
and  the  no-cycle  property. 

It  is  known  that 

(2.10)  Theorem  (Palis  [20]).  If  / satisfies  Axiom  A and  is  i2-stable 
then  / has  no  cycles. 

So  the  problem  really  is:  Does  O-stability  imply  Axiom  A?  There  are 
some  partial  results  here  by  Franks  [7.  8],  Guckenheimer  [9],  and  more 
recently  Mane  [16],  but  the  problem  is  still  open.  Moreover,  all  these 
approaches  are  strictly  C1  because  they  rely  on  Pugh's  general  density 
theorem,  which  states: 
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(2.1 1)  Theorem  (Pugh  [24]).  For  the  generic  / in  Diff1  (A/).  Q(/)  = 
Per  (/)• 

So  anything  that  is  Q-stable  in  DifT 1 (M)  already  satisfies  Axiom  A(b). 

To  know  whether  Pugh's  closing  lemma  or  general  density  theorem  holds 
in  DifT  (M).  r > 1.  is  absolutely  crucial  for  our  thinking.  Personally.  I find 
the  C1  proof  so  difficult  already  that  I frequently  despair  and  try  to  find 
counterexamples. 

Turning  our  attention  to  structural  stability  we  define  the  strong  trans- 
versality  condition  as  in  Smale  [40].  If f e I)ifTr  (M)  and  ip  e M. 

W'W)  = !>■  e M \d(Jn(ip),  /"(>-))  - 0 as  n - ao} 
WuW)={yeM\dUnm  fn(y))  -» 0 as  n->  -oo}. 

If  / satisfies  Axiom  A.  it  follows  [10.  39]  that  W'(ip)  and  Wu(\p)  are  1:1 
immersed  Euclidean  spaces  for  all  ip  € M. 

(2.12)  Strong  transversality  condition.  If/ satisfies  Axiom  A then  / 
satisfies  the  strong  transversality  condition  iff  W'(ip)  and  M/U(i//)  are  trans- 
versal for  all  (p  e M. 

(2.13)  Definition.  / e Diffr  (.Vf)  is  Morse  Smale  iff: 

(1)  Q(/)  is  finite. 

(2)  / satisfies  Axiom  A and  the  strong  transversality  condition. 

That  il(1)  is  finite  means  that  it  consists  of  a finite  number  of  period 
points. 

The  first  structural  stability  theorem  for  diffeomorphisms  was  Peixoto’s. 

(2.14)  Theorem  (Peixoto  [23]).  The  structurally  stable  diffeomor- 
phisms  are  open  and  dense  in  DifT  (S')  and  are  the  Morse  Smale 
diffeomorphisms. 

Here  the  language  has  changed  but  the  theorem  remains  the  same.  This 
theorem  marked  a revival  in  the  subject.  It  motivated  the  following 
problems: 

(1)  Find  open  and  dense  or  generic  properties  in  DifT  (M). 

(2)  Are  the  Morse  Smale  diffeomorphisms  open  and  dense  and  structur- 
ally stable  on  all  manifolds? 
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In  response  to  (1)  there  are  the  Kupka  Smale  theorem  [12.  36]  and 
Pugh's  general  density  theorem,  as  well  as  some  conjectures  (see  [32].  for 
example). 

In  response  to  (2)  we  have  seen  that  the  Morse  Smale  difTeomorphisms 
are  not  open  and  dense  by  Smale’s  examples  and  the  Lefschetz  formula 
argument.  This  still  left: 

(2a)  Are  the  Morse  Smale  difTeomorphisms  structurally  stable? 

(2b)  Are  the  structurally  stable  diffeomorphisms  open  and  dense? 

In  response  to  (2a)  there  are  the  theorems  of  Palis  [19]  and  Palis  Smale 
[21].  which  we  may  state  as: 

(2.15)  Theorem  (Palis-Smale).  The  Morse  Smale  difTeomorphisms 
are  precisely  the  structurally  stable  difTeomorphisms  with  a finite  Q. 

And  because  of  this  theorem  and  the  ^-stability  theorem,  the  conjectures 
of  Smale  [40]  and  Palis  Smale  [21]: 

(2.16)  Conjecture  (Palis-Smale).  f e Diffr(M)  is  structurally  stable 
iff  / is  Axiom  A and  satisfies  the  strong  transversality  condition. 

Robbin  [25]  proved  half  of  this  conjecture  for  / at  least  C2.  and 
Robinson  [26]  relaxed  the  hypothesis  to  C1. 

(2.17)  Theorem  (Robbin-Robinson).  If  / is  Axiom  A and  satisfies 
the  strong  transversality  condition,  then /is  structurally  stable. 

Once  again  one  can  prove: 

(2.18)  Theorem  [40].  If  / is  structurally  stable  and  satisfies  Axiom  A. 
then  / satisfies  the  strong  transversality  condition. 

So  the  real  problem  is:  Does  structural  stability  imply  Axiom  A? 

Of  course,  if  O-stability  implies  Axiom  A so  does  structural  stability.  It 
is  almost  unthinkable  that  one  would  and  the  other  would  not. 

As  far  as  (2b)  goes,  Smale  [3K]  showed  in  1965  that  structurally  stable 
difTeomorphisms  were  not,  in  general,  dense  in  DifTr  (M)  with  the  Cr 
topology.  He  invented  the  notion  of  O-stability  and.  in  196S.  Abraham 
and  Smale  [1]  showed  that  Q-stable  difTeomorphisms  are  also  not  dense 
in  the  C'  topology.  There  are  still  a few  concepts  of  stability  that  might 
prove  dense,  such  as  topological  O-stability  [ 1 1 ] or  the  topological  structural 
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stability  of  attractors  of  Thom  Both  of  these  concepts  include  diffeomor- 
phisms  that  are  not  Q-stable.  But  it  is  certainly  conceivable  that  a much 
weaker  notion  of  stability  would  be  required  tor  density. 

As  most  of  the  promising  approaches  to  the  problem  of  whether 
ii-stabihty  implies  Axiom  A rely  on  the  periodic  points,  it  seems  reasonable 
to  pose  a seemingly  more  general  problem. 

Problem  t.  Let  J e Diff1  (M).  Suppose  that  there  is  a neighborhood  i f 
of ) in  Diff1  (M)  such  that  Nn(g)  = N„(J ) for  all  g e U and  all  n.  Does  / 
satisfy  Axiom  A'.’ 

Q-stuble  diffeomorphisms  have  this  property.  And  the  Axiom  A diffeo- 
morphisms  that  have  this  property  are  Q-stable  by  Palis’s  result  [20],  A 
diffeomorphism  with  this  property  might  reasonably  be  called  zeia-funclion 
stable.  So  we  are  asking:  Is  /eta-function  stability Axiom  A no  cycles^ 
Q-stability? 


III.  The  Lefschetz  Fixed-Point  Formula  and  Stability 

If  / is  an  Axiom  A diffeomorphism.  Bowen  [4]  has  proved  that 
/>(/)  = lim  sup  (1  n ) log  N„(  I ). 

where  hit)  is  the  topological  entropy  of  /.  I will  not  define  topological 
entropy  here  but  will  simply  use  / 1(  / ) for  lim  sup  (1  n)  log  Nn(J  ) when  / 
satisfies  Axiom  A. 

I If:  M — .V/  is  continuous,  we  can  define  s(  /*)  to  be  the  spectral  radius 
of  /*:  HJX1.  R)  ->  //*(M,  R),  that  is,  s(  /*)  = max  | A | , where  the  max  is 
taken  over  all  eigenvalues  of  /*,  and  all  i.  It  is  not  difficult  to  see  that 

(I)  log  s(  /*)  = lim  sup  ( 1/m)  log  |£  tr  |. 

(II)  Let  /( / ) = lim  sup  (1  In)  log  |^  ( - 1)'  tr  ,/*,|. 

log  s (/*)  > HI)  because  there  is  no  alternation  of  signs  in  (I).  whereas 
there  is  in  (II). 

Example  4.  Let/:  M ->  M,  and  let  0:  S'  — S'  be  an  irrational  rotation. 
Let  g:  M x S'  -»  M x .S’1  be/  x I).  Thus  by  the  Runneth  formula.  Tr  g *<  = 
tr/*,  + tr /*<_!•  Because  of  the  alternations  of  signs  in  (II), 
V ( - | )'  tr  </*,  = 0 for  all  n and  1(g)  = log  (0)  = - <x.  The  eigenvalues  of  the 


24 


MICHAEL  SHI  B 


</,,  arc  the  same  as  the  eigenvalues  of  the  so  s(/,)  = s(ym)  and 
log  s(JJ  - log  s(qj.  If  /:  T2  -»  7' 2 were  the  Thom  diffeomorphism  then 
t/:  T’  — T\  l(tf)  = -oo,  and  log  s(qj  = log  [(3  + y/5)/2]. 

We  can  restate  Proposition  2.4  as 

(3.1)  Proposition.  If  (ye  Diffr  (A/)  is  zeta-function  stable,  then 

lim  sup  ( 1/m ) log  iV„(y)  > /(ty). 

But.  in  fact,  one  should  be  able  to  do  much  better: 

(3.2)  Conjecture.  If  q e DifT 1 (M)  is  zeta-function  stable,  then 

lim  sup  (1/n)  log  Nn(q)  > log  s(qj. 

It  is  difficult  to  see  how  to  attack  this  conjecture  without  the  tools  of 
Axiom  A and  no  cycles,  but  perhaps  some  sort  of  algebraic  approximation 
arguments  would  do  the  trick.  Bob  Williams  and  I have  recently  proved  this 
conjecture  for  Axiom  A and  no-cycle  diffeomorphism  by  proving  the 
following,  which  was  conjectured  in  [31]  and  [33]. 

(3.3)  Theorem  [35].  If/e  Diffr  (M)  satisfies  Axiom  A and  has  no 
cycles,  then 

h(f)  > ^g  .%•(/,). 

This  theorem  was  previously  known  for  Morse  Smale  diffeomorphism 
[2X,  33]  and  O-dimensional  Axiom  A and  no-cycle  diffeomorphisms  [5], 

Let  us  give  a corollary  that  includes  the  case  of  Morse  Smale  diffeo- 
morphisms. 

(3.4)  Corollary.  Suppose  / e Dili1  (M)  is  zeta-function  stable  and  has 
only  finitely  many  periodic  points;  then  every  eigenvalue  of f0:  //^(M,  R)  -* 
H0(M,  R)  is  a root  of  unity. 

To  see  this  corollary  we  note  that  each  periodic  point  of  / must  be 
hyperbolic.  By  Pugh’s  theorem  a nearby  <y  will  have  its  nonwandering 
set  equal  to  the  set  of  periodic  points,  so  it  will  satisfy  Axiom  A.  By 
Palis's  argument  q will  have  no  cycles.  By  the  theorem  h(q)  > log  s(q^)  = 
log  s(JJ  and  h[q)  — 0.  So  every  eigenvalue  of/,  has  modulus  1.  As  the  /„ 
are  integral  matrices,  every  eigenvalue  is  a root  of  unity. 
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If  we  return  to  Example  4 for  a moment  we  see  that  <y : 7'3  -*  T3  has  no 
periodic  points  at  all  and  this  corresponds  precisely  to  L(q")  = 0 for  all  n 
in  the  Lefschetz  trace  formula.  But  any  Axiom  A and  no-cycle  / that  is 
even  homologous  to  q must  have  infinitely  many  periodic  points;  in  fact, 
h(f)  > log  [(3  + v7  5)/2].  So  we  naturally  have  the  question:  Is  q isotopic 
to  an  Axiom  A and  no-cycle  difTeomorphism?  Smale  answered  this  question 
in  1971. 

(3.5)  Theorem  (Smale  [43]).  Any  / e DiflT  (M)  is  isotopic  to  an 
Axiom  A and  no-cycle  difTeomorphism  and  hence  an  i2-stable  difTeo- 
morphism. 

This  theorem  was  generalized  in  [30]  with  all  the  details  carried  out  in 
[33]  and  [44], 

(3.6)  Theorem.  Any  / e Diffr(A7)  is  isotopic  to  an  Axiom  A and  strong 
transversality  difTeomorphism  and  hence  a structurally  stable  difTeo- 
morphism. Moreover,  the  isotopy  may  be  chosen  to  be  C°  small,  so  the 
structurally  stable  diffeomorphisms  are  dense  in  DifTr  (M)  with  the  C° 
topology. 

This  theorem  shows  that  there  are  no  topological  obstructions  to 
structural  stability  and  Theorem  (3.3)  shows  that  the  periodic  point  struc- 
ture of  the  Axiom  A and  strong  transversality  diffeomorphisms  (which  are 
the  only  known  structurally  stable  diffeomorphisms)  will  be  very  rich.  In 
many  cases  the  growth  rate  of  N„  will  be  exponential,  whereas  the  Lefschetz 
formula  predicts  no  periodic  points  at  all.  Naturally,  then  one  is  led  to 
ask  if  log  v(  /*)  is  the  best  lower  bound  one  can  find  for  h(  f)  in  terms  of  the 
homology  theory  of/  when  / is  Axiom  A and  no-cycles.  This  problem  in 
terms  of  simplest  diffeomorphisms  is  discussed  at  some  length  in  [31]  and 
[33].  Here  I will  restrict  myself  to  the  Morse  Smale  case,  because  the 
distinction  between  finite  and  infinite  periodic  points  is  the  sharpest,  and 
the  Morse  Smale  diffeomorphisms  are  the  simplest.  Here  there  is  a partial 
converse  to  Corollary  (3.4). 

(3.7)  Theorem  [33].  Letdim  M > 6and7t,(M)  = 0. If/ e DifF(M)and 
every  eigenvalue  of/*  is  a root  of  unity  [that  is,  log  ,v(  /*)  = 0],  then  there  is 
an  integer  n > 0 such  that/"  is  isotopic  to  a Morse  Smale  difTeomorphism. 

A precise  condition  for  / itself  to  be  isotopic  to  a Morse  Smale  difTeo- 
morphism is  given  in  [33],  but  is  interesting  that  n cannot  always  be  chosen 
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equal  to  1 in  the  theorem.  There  is  a further  obstruction  related  to  the 
structure  of  the  ideals  in  the  ring  of  integers  of  the  cyclotomic  extensions 
of  the  rationals.  This  obstruction  is  still  not  well  understood  (see  [33]).  but 
it  is  finite. 

So  we  are  getting  closer  to  understanding  how  the  homological  informa- 
tion about  / gives  lower  bounds  for  h(f)  for  any  structurally  stable  f and 
these  lower  bounds  are  sharper  than  the  Lefschetz  fixed-point  formula  will 
give  us.  Lower  bounds  are  all  that  is  possible.  Smale’s  examples  can  be 
adapted  to  show  that  if  dim  M > 2 , /e  DifTr  (M).  and  k > 0,  then  there  is 
an  Axiom  A and  strong  transversality  difTeomorphism  y that  is  isotopic  to 
/ and  h(y)  > k. 

Let  me  close  with  a small  discussion  of  a possible  generalization  of 
Theorem  (3.3)  in  terms  of  topological  entropy,  which  would  give  informa- 
tion about  the  orbit  structure  of  every  smooth  map  in  terms  of  homological 
information.  Here  h(f)  will  stand  for  the  topological  entropy  of  any 
continuous  function;  for  definitions  and  a further  discussion  see  [31], 

(3.8)  Conjecture  [31].  Let/;  M -»  M be  a smooth  map  (or  difTeo- 
morphism); then  h(f)  > log  s(jm). 

Here  there  is  some  information  known.  Example  2 shows  that  the  con- 
jecture fails  for  continuous  maps;  in  fact,  it  fails  for  some  piecewise  linear 
homeomorphisms  of  manifolds,  so  smoothness  is  essential.  We  do  know, 
however,  as  a special  case  of  work  of  Manning  that: 

(3.9)  Theorem  (Manning  [17]).  Let  /:  M -*  M be  continuous.  Then 
h(f)  > log  *(./*,),  where/*,:  //*,(M.  K)->//*,(M,  R). 

(3.10)  Corollary  (Manning  [17]).  Let  f:  M -»  M be  a homeomor- 
phism,  and  let  dim  M < 3.  Then  /i(  / ) > log  s(fj. 
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Chapter  2 : GENERAL  THEORY 


Nonlinear  Oscillations  in  the  Frame  of  Alternative 
Methods* 

LAMBERTO  CESARI 

Department  of  Mathematics 

University  of  Michigan.  Ann  Arbor.  Michigan 


1.  Introduction 

The  injection  of  methods  of  functional  analysis  (Cesari  [5.  6],  1963)  in  the 
classical  bifurcation  process  of  Poincare  [33],  Lyapunov  [24],  and  Schmidt 
[35]  and  extensive  subsequent  work  have  made  this  process  a fine  tool  in 
nonlinear  analysis,  particularly  in  the  difficult  problems  “at  resonance"  in  the 
usual  terminology.  The  general  theory  that  has  ensued,  with  all  its  variants 
and  ramifications,  is  often  referred  to  as  bifurcation  theory,  or  alternative 
methods. 

A great  many  ideas  have  been  brought  to  bear  in  this  theory  in  the  last 
few  years,  such  as  Schauder’s  fixed  point  theorem  (Cesari  [5],  Landesman 
and  I.a/er  [21  j.  Williams  [38]):  Banach's  fixed  point  theorem  (Cesari  [5.  6] 
and  much  subsequent  work):  invariance  properties  of  topological  degree 
(Cesari  [5].  Knobloch  [20].  Cronin  [10],  Williams  [37],  Mawhin  [26.  27] ) • 
Brouwer’s  fixed  point  theorem,  particularly  C.  Miranda’s  equivalent  form 
(Cesari  [6],  Knobloch  [20]):  the  theory  of  monotone  and  maximal  monotone 
operators  in  Banach  spaces  (Gustafson  and  Sather  [11],  Cesari  and  Kannan 
[9]);  Schauder’s  principle  of  invariance  of  domain  (Kannan  [17]):  implicit 
function  theorem  and  Newton’s  "polygon  ” method  in  Banach  spaces  (Sather 
[M)l. 

For  self-adjoint  problems  we  have  presented  results  of  the  theory  in  a 
recent  paper  [8]  In  the  present  paper,  therefore,  we  shall  mention  recent 
aspects  of  the  theory,  mainly  for  non-self-adjoint  problems.  We  shall  present 

•this  research  was  partially  supported  by  AEOSR  Research  Project  71-2122  at  the 
I. Diversity  of  Michigan 
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first  the  modification  of  Cesari's  alternative  scheme  proposed  by  Hale, 
Bancroft,  and  Sweet  [15]  for  non-self-adjoint  problems. 


2.  An  Alternative  Scheme 

Let  us  consider  the  equation 

Ex  = Nx,  ( 1 ) 

where  E is  a linear  operator  whose  domain  (/(E)  is  some  subspace  of  a 
Banach  space  X and  whose  range  tf(E)  lies  in  another  Banach  space  K 
In  (1),  N denotes  an  operator,  not  necessarily  linear,  with  2(N)<^X, 
rf(N)  c X 2(E)  n 2(N)  * 0. 

We  simply  assume  here  that  there  are  projection  operators  P:  X ->  X, 
Q:  Y ->  Y and  a linear  operator  H\  2(H)  -*  X , 2(H)  <=  X,  such  that 

(k , ) H(l  - Q)Ex  = (/  - P)x  for  all  x e 2(E) , 

(k2)  QEx  — EPx  for  all  x e 2(E), 

(k3)  EH(I  - Q)Nx  = (/  - Q)Nx  for  all  x e 2(E)  n 2(N), 

and  where  we  assume  that  .^(P)  c 2(E),  Jf(H)  c 2(E),  (I  — Q).tf(E)  c 
2(H). 

Thus  P and  Q are  linear  bounded  idempotent  operators  (P2  = P.Q2  — Q)\ 
X0  = PX,X i = (I  — P)X  are  linear  subspaces  of  X ; X0  = QY.  Xj  = (/  — Q)Y 
are  linear  subspaces  of  X;  and  H : 2(H)  -*  X can  be  thought  of,  from 
(k,)  and  (k3),  as  a partial  inverse  of  E. 

Note  that  for  any  x such  that  Ex  = 0,  we  have  from  (k , ) that  (I  — P)x  = 0, 
or  Px  = x,  or  x e PX  — X0 . Thus,  the  null  space,  or  kernel,  of  E must  be 
contained  in  X0,  or  briefly,  ker  £cX0, 

It  is  apparent  from  (k13)  that  it  is  enough  to  define  H on  (/  — Q)Y  = X,, 
namely,  we  can  well  assume 

2(H)  a Y{  with  3t(H)  c 2(E)  n X 

Moreover,  if  we  restrict  H to  be  a partial  inverse  of  E,  then,  besides 
■#(H)  a 2(E),  we  need  also  2(H)  a .tf(E),  and  this  implies  that 
(/  - Q\'A(E)  cz  jf(E)  n X,.  However,  since  I — Q acts  as  the  identity  operator 
on  X,,  we  must  also  have  (I  — Q).tf(E)  =)  tf(E)  n X,.  Thus, 


2(H)  = (/  — Q)Jt(E)  = #(E)  n X,. 
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Finally,  from  (k3)  we  need  (/  — Q).tf(N)  a (/(H).  This  will  certainly  be  the 
case  if  rf(E)  n Yj  = Tj,  and  then 

(/(H)  = (l-  Q).Jt(E)  = .Jf(E)  n y,  = y,. 

Finally,  if  Q and  £ are  so  related  that  -#(E)  = Y{  = (I  — Q)Y.  then 
V'0  = Q F is  the  complementary  space  of  in  K and  T0  is  then  the 
cokernel  of  £.  or  coker  E = Y0  = QY.  Then  we  have  also  QEx  = 0,  EPx  = 0, 
for  all  .v  e (/(E),  and  X0  = ker  E. 

(2.i)  Theorem.  If  (k123)  are  satisfied,  then  Ex  = Nx  for  some 
x e (/(E)  n (/(N)  if  and  only  if 


x = Px  + H(I  ~ Q)Nx, 

(2) 

Q(Ex  - Nx)  = 0. 

(3) 

Proof.  First,  Ex  = Nx  implies  (I  — Q)(Ex  — Nx)  = 0 and 

Q(Ex  - Nx)  = 0. 

Since  Ex  = Nx  e Jt(E),  we  have  (/  — Q)Ex  = (I  — Q)Nx  e (/  — Q).rf(E)  = 
(/(H).  By  applying  H to  the  equation  (I  - Q)(Ex  — Nx)  = 0.  we  have 
H(l  — Q)Ex  = H(I  — Q)Nx,  and  by  (k,)  also  (I  — P)x  = H(1  — Q)Nx,  and 
finally  .x  = Px  + H(I  — Q)Nx,  and  system  (2),  (3)  is  satisfied.  Conversely, 
ifsystem  (2),  (3)  is  satisfied,  then  (/  - P)x  = H(I  - Q)Nx,(I  - Q)Nxe  (/(H), 
H(I  — Q)Nx  e Jf(H)  cz  (/(E),  thus  (I  — P)x  e (/(E),  and  by  applying  £ to 
this  equation,  also  £(/  — P)x  = EH(I  — Q)Nx.  By  (k2)  and  (k3)  we  have 
then  Ex  — QEx  = (/  — Q)Nx,  and  finally 

Ex  - Nx  = Q(Ex  - Nx)  = 0. 

Equations  (2)  and  (3)  can  also  be  written  in  the  form 


= x*  + H(I  - Q)Nx , 

(2') 

Q(Ex  - Nx)  = 0, 

(3') 

where  x*  = Px  is  an  element  of  X0  = PX.  As  we  shall  see.  for  a large 
class  of  problems  it  is  possible  to  choose  P.  Q,  H in  such  a way  that 
Eq.  (2')  is  uniquely  solvable  for  every  x*  e X0 . If  x = x(x*),  x*  e X0,  is  the 
unique  solution  of  (2');  then  (3  ) reduces  to  an  equation  Mx*  = 0 in  the 
unknown  x*  in  X0.  Often,  X0  is  a finite-dimensional  space,  and  thus  the 
original  problem  ( 1 ) is  reduced  to  an  alternative  problem  Mx*  = 0 in  a 
finite-dimensional  space  (alternative  methods).  Equation  (2  ) is  said  to 
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be  the  auxiliary  equation,  and  (3')  the  bifurcation,  or  determining  equation. 
Actually,  the  auxiliary  equation  (2  ) is  of  the  form  x — H(I  — Q)Nx  = x*, 
or  (/  + KN)x  = x*.  that  is,  a Hammerstein  equation. 


3.  Reduction  to  an  Alternative  Problem  by  Contraction  Maps 

We  shall  now  make  the  following  assumption: 

(k4)  There  are  positive,  not  decreasing  functions  x(p),  (i(p)  such  that 
for  all  x,  x'  with  j|x||,  ||x'||  < p we  have 

\\H(I -Q)Nx\\<P(p), 

|| H(I  - Q)Nx  - H(I  - Q)JVx'||  < a(p)||x  - x'||. 

Thus,  if  || H(l  - Q)||  < K,  and  |jNx  — Nx'\\  < L\\x  — x'||  for  ||x||,  ||x'||  < p. 

then  we  can  take  a (p)  — LK. 

Next,  for  any  two  positive  constants  c,  d,  we  consider  the  sets 
V(c)  = {x*eP(X)\\\x*\\<ci 
S(x*.  c,  d)  = jx  6 X | Px  = x*.  || xj|  < (/}, 

with  x*  6 V(c). 

(3.i)  Theorem.  Under  the  hypotheses  (k1234),  and  positive  constants 
c < d,  with  x(d)  < 1,  (1(d)  < d — c,  the  map  T:  X -*  X defined  by 
T — P + H(I  — Q)N  maps  S(x*.  c,  d)  into  itself  and  is  a contraction. 

Proof.  For  x.  x'  e S(x*.  c.  d),  we  have 

Px  = Px'  = X*.  ||x*||  < c, 

| Tx||  = ||  Px  + H(1  - 0)Nx||  < ||  x*  ||  + I H(l  - Q)Nx\\  <c+  (d  - c)  = d. 

|| Tx  - r.x'H  = \H(l  - Q)(Nx  - Nx')||  < x(df  x - x'||. 

Thus.  T is  a contraction  map. 

Under  the  above  hypotheses  and  for  any  x*  e F(c)  there  is,  therefore,  by 
Banach’s  theorem,  a unique  fixed  point  x = Tx  = F(x*)e  S(x*.  c,  d)  with 
Px  = PTx  ■=  x*.  which  satisfies  the  auxiliary  equation  (2').  Thus,  system  (2'). 
(3')  is  reduced  to  the  single  (bifurcation)  equation  Mx*  = 0,  or 

Q(E  — N)F(x*)  = 0.  x*  e V(c). 


(4) 
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Of  course,  in  applications  where  []  is.  for  instance,  a square  norm  or  an 
L,-norm.  it  may  occur  that  N does  not  satisfy  the  boundedness  and 
Lipschitz  requirement  (k4).  Indeed,  if  N is  the  Nemitsky  operator  correspond- 
ing to  some  real  function  f which  may  be  a polynomial,  then  condition 
(k4)  and  corresponding  Theorem  (3.i)  can  be  shown  to  hold  in  a local 
sense  by  a number  of  devices,  for  instance,  the  use  of  an  associated  second 
norm,  say  a sup  norm,  and  by  restricting  the  search  of  solutions  in  suitable 
domains  of  the  function  space  X . say  a ball  in  the  Sup  norm  (Cesari  [5.  6]). 
Other  devices  are  mentioned  in  [8]  (see  also  Section  11). 

Statement  (3.i)  has  been  repeatedly  used  in  solving  the  auxiliary  equation, 
where  d is  either  an  a priori  bound  for  the  solution,  or  a number  to  be 
determined  at  the  numerical  stage. 

That  P,  Q.  H can  be  chosen  in  such  a way  to  make  k = H(I  — Q)  as 
small  as  we  want,  and  then  a (p)  = Lk  < 1.  has  been  shown  by  Cesari 
([6];  see  also  [8])  for  self-adjoint  problems  (X  = Y = S a Hilbert  space), 
and  by  Hale  ([13];  see  Section  5)  also  for  non-self-adjoint  problems 
(X  Hilbert,  Y Banach). 

These  results  are  relevant  because  they  show  that  it  is  “in  general"  possible 
to  determine  P.  Q.  H.  and  thus  perform  the  splitting  of  Eq.  ( 1 ) into  auxiliary 
and  bifurcation  equations  (2)  and  (3).  in  such  a way  that  the  auxiliary  one 
can  be  solved  by  the  Banach  fixed  point  theorem,  and  thus  problem  (1) 
is  theoretically  reducible  to  the  sole  bifurcation  equation  (the  alternative 
problem,  often  in  a finite-dimensional  space).  An  analogous  remark  holds 
in  relation  to  monotone  operators  (see  the  end  of  Section  6). 

It  is  also  relevant  to  say  here  that  the  bifurcation  equation  can  often 
be  handled  by  topological  degree  and  Leray  Schauder  techniques  [23]. 


4.  Topological  Degree  Argument 

Let  us  replace  problem  ( 1 ) by.  say. 

Ex  = kNx,  0 < / < 1,  (5) 

and  thus  (2)  and  (3)  are  replaced  by 

.x  = Tx  = Px  + W(1  - Q)Nx,  (6) 

Q(Ex  - z. Nx ) = 0.  (7) 

We  assume  here  that  the  space  T0  = QY  is  finite-dimensional.  We  also 
assume  that  an  a priori  bound  for  the  solution  v of  problem  (5)  can  be 
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found  that  is  independent  of  A,  0 < A < 1.  Let  us  choose  P , Q , H in  such 
a way  that  T is  a contraction  map  in  the  entire  solid  ball  B with  center 
at  the  origin,  which  is  determined  by  the  a priori  bound.  If  x = x(x*,  a)  is  the 
solution  of  the  auxiliary  equation,  and  we  write  briefly  the  bifurcation 
equation  (7)  in  the  form  Sx  = S(x*,  A)  = 0,  then  S(x*,  A)  # 0 on  the  boundary 
CB*  of  the  projection  PB  — B*  of  B.  Thus,  the  topological  degree 
deg  (0;  Sx,  B*)  is  defined  and  does  not  depend  on  A,  0 < A < 1.  Actually, 
for  A = 0 the  problem  is  linear,  and  deg  (0;  S0,  B*)  may  be  easy  to 
determine.  If  the  latter  is  nonzero,  then  deg  (0,  St,  B*)  = deg  (0;  S0,  B*)  ^ 0, 
the  bifurcation  equation  is  solvable,  and  so  is  the  given  problem  (1). 

This  Leray  Schauder  type  argument,  in  connection  with  alternative 
methods,  has  been  consistently  used  by  Mawhin  (see,  e.g.,  [26]). 

In  connection  with  this  type  of  argument,  Williams  [37]  considered  the 
self-adjoint  case  X - Y,  P = Q.  S0  finite-dimensional,  T a contraction.  In 
this  situation  Williams  noted  that  the  fixed  points  z = Wz  of  the  map  W 
defined  by  Wz  = Pz  + H(l  - P)Nz  - P(E  - N)Fz  are  exactly  the  solu- 
tions of  the  original  problem  (1).  The  map  W may  not  be  compact,  but 
the  analogous  map  W',  defined  by  W'z  — WFz , is  compact,  so  that  a Leray- 
Schauder  degree  jls  for  the  map  / — W'  can  be  defined.  Williams  proved 
that  ijs  is  then  equal  to  the  Brouwer  degree  of  the  finite-dimensional  map 
S,  corresponding  to  the  bifurcation  equation.  In  the  same  direction,  Mawhin 
[27]  then  considered  the  non-self-adjoint  case  with  A"0  = PX  = ker  £, 
Y0  = QY  — coker  E , X0  finite  dimensional.  Then,  for  any  map  S:  Y0  -»  X0 
with  S 1 (0)  = 0,  the  fixed  points  z=Wsz  of  the  map  Ws,  defined  by 
Wsz  — Pz  - V H(I  - Q)Nz  + SQNz , are  the  solutions  of  (1).  Under  suitable 
hypotheses  of  boundedness  on  N this  map  Ws  is  compact,  and  a Leray- 
Schauder  degree  i'is  for  / — Ws  can  be  defined  (coincidence  degree). 


5.  Hale’s  Statement  Concerning  the  Norm  of  H 

(a)  Let  us  assume  that  E , P.  Q are  given  as  requested,  so  that 

.*(P)  e ^(£),  ker  E a X0  = PX, 

(l  - Q\#{£)  = AE)  n Vj,  QEx  = EPx. 

Let  E'  be  the  map  E restricted  to  D'  = (/(£  ) = X,  n (/(E),  and  let 
R'  = .^(£  ),  so  that  £':  D'  -*  R'  is  one  one  and  onto.  Then,  necessarily  £' 
has  an  inverse  //:£'->  D',  which  is  linear  and  onto.  As  a consequence  of 
the  closed-graph  theorem,  we  also  have 


NONLINEAR  OSCILLATIONS  IN  THE  ERAME  OF  ALTERNATIVE  METHODS  35 


(5.i)  If  the  linear  map  £ has  closed  graph  and  closed  range,  then  H is  a 
bounded  linear  map. 

Concerning  estimates  on  the  norm  ||//||  of  H , the  following  simple  remark 
is  relevant: 

(5.ii)  We  have  \\H\\  < k if  and  only  if  ||(/  - P)x  ||  < k||£x||  for  all  x e (/(E). 

Indeed,  for  y e R'  and  x e D'  with  y = E'x,  we  have  Hy  = HE'x  = 
(I  — P)x.  Then,  ||(/  - P)x\\  < kj|£x||  if  and  only  if  \\Hy\\  < k||y||.  Thus,  we 
need  not  compute  H to  estimate  its  norm  ||//||.  The  latter  can  be  computed 
directly  from  £ and  P. 

(b)  Let  X and  Y be  Banach  spaces,  £:  (/(E)  -*  Y a linear  operator  from 
a linear  subspace  (/(E)  of  X with  closed  graph  and  closed  range,  ard  f/(E) 
everywhere  dense  in  X.  Let  us  make  the  assumption  that  there  are  projector 
operators  P0 : X -*  X,  Q0 : Y ->  Y such  that  ker  £ = P0  A"  = X0,  coker  £ = 
Q0  Y = T0,  and  let  X = + A",,  Y = T0  + T,  (direct  sums),  Xt  = 

(I  - P0)X 0,  Yi  = (/  - Q0)Y0 . Let  H0 : ^ ->  Xx  n®(£)  denote  the  inverse 
operator  of  £':  A-,  n .(/'(£)-»  Tj.  Thus  H0  is  a linear  bounded  operator. 

Then,  for  every  x e (/(E)  we  certainly  have  Q0Ex  = 0,  EP0x  = 0, 
£(/  — P0)  = £,  H0Ex  = H0E(I  — P 0)x  — (I  — P0)x,  and  EH0y  = y for  all 
ye  T,,  that  is,  (k123)  hold.  Consequently,  as  in  (1),  we  can  derive  from 
Ex  = Nx  Eqs.  (2)  and  (3),  which  now  have  the  simpler  form 

x = P0x  + H0(I  - Q0)Nx,  Q0  Nx  = 0.  (8) 

If  S:  X -»  X is  any  projector  operator  with  SX  c X,  n (/(£),  SP0  = 0, 
then  we  also  have  P0  S = 0,  and  P = P0  + 5:  X -*  X is  a projector  operator. 
Moreover,  SP0  = 0,  (I  — P)P0  = 0,  and  by  applying  I — P and  S to  the 
first  of  relations  (8),  then  the  same  relations  (8)  become 

x = Px  + (I  - P)H0(I  - Q0)Nx,  (9) 

Sx  = SH0(1  - Q0)Nx , Q0  Nx  = 0.  ( 10) 

Relation  (9)  is  the  auxiliary  equation ; relations  ( 10)  together  are  the  bifurca- 
tion equation.  It  is  clear  that,  under  the  same  assumptions  as  above,  relations 
(9)  and  (10)  imply  Ex  = Nx.  Thus,  Ex  = Nx  and  system  (9),  (10)  are 
equivalent. 

(5.iii)  (Hale  [13]).  Under  the  hypotheses  above  with  X a Hilbert  space 
and  H0  compact,  given  k > 0,  it  is  possible  to  choose  a projector  operator 
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S:  X -*  X,  || 5 1|  = 1,  with  finite-dimensional  range,  such  that 

SX  c X,  n (/(£), 

SP0  = 0,  and  if  P = P0  + S,  then  ||(/  - /J)H0||  < k. 

Thus,  if  N is  Lipschitzian  of  constant  L in  the  hall  in  X with  center  at 
the  origin  and  radius  p,  then  for  all  x„  x2  e X,  ||x,||,  ||x2||  < p,  we  have 

ll(/  ~ P)H0(I  ~ Qo)(N*i  ~ Nx2)||  < ft||/  ~ Coll^lx,  - x2||, 

and  a(p)  = fc||/  — Q0||L  can  be  made  as  small  as  we  want.  If 
||(/  ~ Qo)Nx\\  < M for  all  j|x||  < p,  then  ||(/  - P)H0Nx ||  < kM  = /f(p). 
Consequently,  for  0 < c < d < p,  a(p)  < 1,  /f(p)  < d - c.  and  for  every 
x*ePX,  ||x*||  < c,  the  mapping  T defined  by 

Tx  = x*  + (/  - P)  H0(\  - Q0)Nx 

is  a contraction  on  the  subset  [xeX,  Px  = x*,  ||x||  < (/]  of  X.  Thus 
Eq.  (1)  is  reducible  to  an  alternative  problem.  For  a proof  of  ( 5 . i i i ),  see 
Hale  [13,  p.  13]. 


6.  Reduction  to  an  Alternative  Problem 
by  Monotone  Operators 

If  Y = X*  is  the  dual  space  of  the  Banach  space  A",  and  we  denote  by 
<x,  y)  the  pairing  of  x and  y,  or  the  linear  operation  y(x),  x e X,  ye  X *, 
then  a linear  operator  T:  X -*  X*  is  said  to  be  positive  provided 
<x,  Tx)  > 0 for  all  x e X.  Let  Y ~ X*  and  let  us  use  the  same  notation 
as  in  (5.ii). 

(6.i)  We  have  <x,  —Ex)  >0  for  all  xe  D'  if  and  only  if  < — Hy,  y)  > 0 
for  all  ye  R'.  If  <x,  -Ex)  > p||x||2  for  some  p > 0,  then 

<-Hy,y)>n\\Hy\\2.  (II) 

Indeed,  as  in  (5.ii),  we  have  ( — Hy,  y)  = <x,  -Ex).  Thus,  again,  we 
need  not  compute  //  to  verify  its  positive  character,  which  instead  can  be 
derived  from  E. 

In  our  previous  paper  [8]  we  have  shown  that,  for  self-adjoint  problems 
(in  particular,  X = Y = S,  a Hilbert  space),  it  is  always  possible  to  choose 
P,  Q,  H,  in  such  a way  that  (11)  is  satisfied  [X,  formula  (X)].  Also,  we 
have  shown  in  [X]  the  relevance  of  property  (11)  in  proving  the  solvability 
of  both  the  auxiliary  and  bifurcation  equations,  and  therefore  of  the  given 
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problem  (1),  in  self-adjoint  problems  (with  X = Y = S,  a Hilbert  space), 
under  monotonicity  hypotheses,  or  analogous  ones,  on  N.  (See  Cesari  and 
Kannan  [9]  and  also  [ft],  where  full  proofs  are  given  based  on  Brezis, 
Crandall,  and  Pazy’s  theory  of  monotone  and  maximal  monotone 
operators.) 

As  we  shall  see  in  Section  7 relation  (II)  holds  also  for  non-self-adjoint 
elliptic  differential  operators  E,  ordinary  and  partial,  under  very  mild  assump- 
tions and  when  suitable  operators  P,  Q , H are  chosen,  as  Osborn  and 
Sather  have  proved  [32].  Here  we  give  a statement  that  is  analogous  to 
another  one  recently  proved  by  Osborn  and  Sather  [32]  concerning  the 
solvability  of  the  auxiliary  equation  under  monotonicity  conditions  on  N. 
This  statement  (6.ii)  is  an  extension  to  non-self-adjoint  problems  of  Cesari 
and  Kannan’s  theorem  (ft.i)  of  [ft]. 

To  this  effect  let  us  assume  that  the  operators  P,  Q,  H have  been  chosen 
so  that  (k123)  hold  and  that  there  exists  some  constant  /i  > 0 such  that  the 
following  strong  positivity  property  holds: 

<x,  — Ex)  > //||x||2  for  x e D n X ,,  Xl=(l  — P)X, 
or  (12) 

<-//>',  y)  > n\\Hy\\2  for  yeVj,  = (I  - Q)Y. 

Concerning  N:  's(N)-*Y,  X = y(N),  we  shall  assume  that  N is 
hermcontinuous,  and  that  N is  monotone,  that  is, 

<Nxi  — Nx2,  Xj  — x2>  > 0 for  all  x,,  x2  e X.  (13) 

Also  we  shall  assume  that  P and  Q have  been  so  chosen  that 

<x,  y>  = 0 for  all  x e X,  = (/  - P)X  and  yeY0  = QY.  (14) 

(6.ii)  Under  hypotheses  (k123)  and  (12)  (14),  with  A"  reflexive  and  Y = X*, 
the  auxiliary  equation  (2  ) has  a unique  solution 

.x  = E(x*),  x 6 x*  + (X , n D)  for  every  x*  e X0. 

Proof.  First  we  note  that  for  the  linear  bounded  operator  -//(/  — Q)\ 
Y -*  X , n I)  we  have 

< - H{I  - Q)y , y>  = <- //(/  - Q)y,  (/  - Q)y ) + < - H(1  - Q)y , Qy} 

zp\\-H(1 -Q)y\\2  (15) 

forallye  Y,  because  of  hypotheses  (12)  and  (14).  Actually,  — H is  a one  one 
map  from  Vj  to  A",  n D.  If  for  any  x e X t n D we  write  x = — Hy,  ye  Vj, 
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then  -H(l  - Q)(y  + Y0)  = x and  [-//(/  - Q )]  lx  = y + Y0.  Thus, 
[-//(/  - 0)]  1 is  a set-valued  map  defined  for  each  x e X , r.  I)  with  values 
y + Y0.  Also,  we  see  that  the  auxiliary  equation  (2  ) or 

x-  x*  - H(1  -Q)Nx  = 0 (16) 

implies 

[~H(I  - Q)]~l{x  - x*)  + NxbO,  x = x*  + (XlnD).  (17) 

Conversely,  if  (17)  holds,  then  x — x*  — //(/  — Q)x  is  in  X () , and  since 
x - x*  e Xt  n D,  and  X0nX,  = 0,  we  derive  x - x*  - //(/  - Q)Nx  = 0. 
Thus,  Eqs.  (16)  and  (17)  are  equivalent. 

Equation  (17)  is  of  the  form  (/I  + B)x9  0,  where  A:  x*  + (A',n/)), 
B:  X -»  X are  the  operators  defined  by  Ax  = [ — //(/  - £)]  1 (x  - x*)  for 
X 6 X*  + (X  j n D),  and  by  Bx  = Nx  for  x e X,  and  A is  a set-valued  map. 
Since  — H(l  — Q ):  Y -»  Xx  n D c A"  is  monotone  and  continuous,  then 
-//(/  - 0)  is  maximal  monotone.  By  hypothesis,  N is  monotone  and  hemi- 
continuous.  Thus,  B = /V  is  also  maximal  monotone.  Finally,  V(/l)  = 
X t n I)  a X - V(B),  where  A"  is  the  whole  space.  Then,  A + B is  maximal 
monotone  (see,  e.g.,  Brezis,  Crandall,  and  Pa/.y  [4]  for  a recent  proof ).  Let 
us  prove  that  A is  coercive.  Indeed,  for  y e Ax  =[  — //(/  — (7)]  'x,  we 
have  x = -//(/  - Q)y , ||x||  = ||  -//(/  - Q)y||,  and  by  (15)  also 

|| x ||  '<x,  y)  = || x || " '<  — //(/  - Q)y,  y > > ||x||“  VII  -//(/  - C)>’l|2  = ^!|x||, 

and  thus  ||x||  '<x.  Ax)  ->  Too  as  ||x||  -*  +oo.  Since  /I  is  coercive,  so  is 
A + B We  conclude  that  .^(/l  + B)  = E Hence,  the  equation  (/I  + «)x  a 0 
is  solvable,  that  is,  (17)  is  solvable,  and  (16)  is  solvable. 

Let  us  prove  that,  for  every  x*  e X0,  (16)  has  a unique  solution.  Indeed, 
if  x , , x2  were  two  solutions,  then 

x,  - x*  = mi  - Q)(NXl ),  x j - X*  = //(/  - Q)(Nx2) 

and 

0 < (Nxi  — Nx2  , x,  — x2> 

= -<Nx,  - Nx2,  -//(/  - (?)Nx,  + //(/  - C)/Vx2> 

< — /i||  —H(l  - Q)Nxi  + 11(1  - Q)Nx2\\2  = -/i||x,  - x2  2. 

where  /t  > 0 and  we  have  used  property  (12)  of  the  operator  //  finis, 
x,  = x2.  We  have  proved  that  the  auxiliary  equation  (2  ).  or  (16).  has  a 
unique  solution  x € x*  -f  (X , n D)  for  every  x*  e X(). 
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It  will  he  shown  elsewhere  that  the  bifurcation  equation  can  also  be 
discussed  under  monotonicity  hypotheses,  or  analogous  ones,  on  N.  thereby 
extending  Cesari  and  Kannan's  results  in  [X,  Section  X], 

7.  Considerations  on  the  Positivity  Property 

Let  X = 5,,  Y - S be  real  Hilbert  spaces,  X c Y,  with  inner  products 
( , ),  and  ( , ),  and  norms  |]  ||,  and  !|  ||,  respectively.  Let  us  assume 
that  X is  everywhere  dense  in  V and  that  the  identity  map  /:  X -*>  Y is 
compact.  Let  £:  D -*  Y be  a linear  operator,  with  domain  I)  = '/(E)  c X , 
D dense  in  X.  with  compact  resolvent  £;(£),  satisfying 

(A)  (x,  -£x)>a||x||?  -b\\x\\2  (IX) 

for  all  x c D,  and  some  constants  a > 0,  h real. 

Thus,  we  assume  that  — E has  a positivity  property  similar  to  the  Garding 
inequality  for  the  Dirichlet  problem,  and  we  know  that  uniformly  elliptic 
differential  operators  (ordinary  and  partial)  with  suitable  boundary  condi- 
tions possess  this  property  (sec,  e.g.,  Agmon  [1,  2|).  Moreover,  under 
hypotheses  of  denseness  of  the  linear  combinations  of  the  generalized  real 
and  complex  eigenfunctions  of  £*,  the  dual  of  £,  the  following  has  been 
proved  (Osborn  and  Sather  [31,  32J):  It  is  possible  to  extend  the  spaces 
X and  Y so  that  relation  (IX)  still  holds  in  the  new  norms  and,  on  the 
other  hand,  given  N > 0 there  is  a projection  operator  P such  that 
.*  i > N |:x |;  for  all  .xeA’,  = (/  — P) X.  Thus,  given  /;>(),  if  we  take 
N2  = b/t;  we  also  have  <x,  - Ex')  > (a  — e)||x||f  for  all  x e X t.  Also,  given 
any  //  > 0 if  we  take  N2  = a '(/<  -f  />),  we  have  <x,  —Ex)  > n |.v||2. 

Remark  1.  Applications  of  the  considerations  of  Sections  6 and  7 to 
global  existence  of  solutions  to  boundary-value  problems  for  elliptic 
differential  equations,  ordinary  and  partial,  will  be  discussed  elsewhere.  In 
Sections  lOand  1 1 we  consider  boundary-value  problems  for  similar  ordinary 
and  partial  differential  equations  from  the  local  viewpoint  and  the  use  of 
the  considerations  of  Section  3. 

Remark  2.  For  strong  nonlinearities  the  requirement  that  N be  defined 
in  the  whole  space  X may  hi  too  restrictive.  For  self-adjoint  (sec  [X], 
Section  13)  as  well  as  for  non-self-adjoint  problems,  a number  of  devices 
have  been  proposed,  namely,  suitable  decompositions  of  the  relevant 
operators  in  products  of  operators  (Gustafson  and  Sather  [11],  Osborn 
and  Sather  [32],  Kannan  and  Locker  [ IX,  19]). 
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8.  A Strongly  Nonlinear  Oscillation  Problem 

The  problem  of  27t-periodic  solutions  of  the  differential  equation 

x"  + x3  = sin  t,  (19) 

has  been  a topic  of  recent  interest  of  several  authors.  Equation  (19)  was 
initially  studied  by  Cesari  [5]  by  alternative  methods,  namely,  by  the  con- 
siderations of  Section  3 (for  self-adjoint  problems)  using  L2  and  Sup  norms, 
and  the  existence  of  a 27r-periodic  solution  was  thereby  proved. 


9.  Forced  Oscillations  in  Lienard  Systems 

Let  us  consider  the  problem  of  existence  of  27t-periodic  solutions  for  the 
nonlinear  differential  system 

x"  -I-  (<//r/f)[grad  G(x)]  + Ax  = p(t).  — x < t + x, 

or  (20) 

n 

X,"  + (d/dt){cG/rXi)  + £ atj Xj  = p,(r).  i = 1 n. 

j=  i 

where  A is  a constant  nonsingular  n x n matrix.  p(t)  is  continuous  and 
2^-pcriodic  with  mean  value  zero,  and  G is  a given  function  from  R"  into 
R of  class  C2. 

Some  of  the  results  are 

(a)  If  A is  negative  semidefinite.  then  the  problem  always  has  a solution. 

(b)  If  A <1.  then  the  problem  has  at  least  one  solution. 

The  proof  is  based  on  the  general  process  of  Section  6 with  the  selection 
of  suitable  Sobolev  spaces  //s[().  27i]  and  H s[0. 2rr]  for  A'  and  V!  and  by  taking 

Ex  = x"  and  Nx  = -{d  )[grad  G(x)]  - Ax  -I-  />(/). 

Under  hypothesis  (a)  or  (b).  the  bifurcation  equation  can  be  chosen  to  be 
one  dimensional  and  it  is  trivially  solvable.  The  auxiliary  equation  is 
solvable  by  Amann's  theorem  [3]  on  the  Hammerstein  equation. 

Results  (a)  and  (b)  coincide  with  those  obtained  by  Mawhin  [28].  and  by 
Kannan  and  Locker  [19],  also  by  alternative  metnods. 

For  A > 1 some  more  information  on  G is  needed  but  the  more  general 
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system  can  he  taken  into  consideration: 

x"  + (<//c/r)(grad  G(x(t)))  + {d/dt)V(x(t),  I ) + Ax(t)  = p(t ),  (20  ) 

where  G(x)  is  a homogeneous  function  from  R"  into  R of  degree  2 p.  of 
constant  sign,  class  C'2.  satisfying  |G(x)|  > c|x|2p  for  some  p > 1,  constant 

c > 0.  Here  F(x.  /)  = (F, FJ  is  continuous  from  R" ' 1 into  R"  together 

with  the  partial  derivatives  r Vjct,  rVJcxj , and  27t-periodic  in  t.  A is  any 
n x » constant  matrix,  and  p(t)  is  continuous  from  R'  into  Rn  and  27t-periodic. 
Then,  system  (20')  has  at  least  one  27t-periodic  solution  if  we  know  that  either 

(c)  p > 2.  | F(x.  f)|  < Cxp  + l)  for  some  C.  D >0;  or 

(d)  p — I.  | K(x.  t)|  < CT | x |p  + D.  with  2c  > C,  and  A symmetric:  or 

(e)  V depends  on  x only.  V(x)  — grad  ^(x).  W of  class  C2.  satisfying 

| K(x)|  < C|x|2"  2 + D.  C.l)>  0.  p>  1. 
and  A is  symmetric  if  p = I. 

Finally,  we  may  consider  the  system 

x"  = (d  dl )(grad  G(x(t)))  + (d/dt)V(x(t),  t)  + Ax(l)  + c/(x(t))  = p(t).  (20") 

where  (/.  V.  A.  p are  as  before,  y:  R"  —>  R"  is  any  continuous  function  with 
t/(x)  | v | -*()  as  | x | ->  x.  and  A is  nonsingular.  The  same  statements  (c). 
(d).  (e)  hold. 

These  and  other  results,  which  extend  previous  ones  by  Mawhin  [28.  29], 
have  been  proved  by  Cesari  and  Kannan  [39.  40]  by  alternative  methods, 
a priori  estimates,  and  the  Borsuk  Ulam  theorem. 

10.  Nonlinear  Non-Self-Adjoint  Problems  for  Ordinary 
Differential  Equations.  Local  Analysis 

We  consider  here,  using  the  work  of  Nagle  [30].  boundary-value  problems 
for  systems  of  n first-order  ordinary  differential  equations  with  homogeneous 
linear  boundary  conditions,  or 

x'  — A(t)x  —fU-  x.  x ).  a < t < b,  x'  = dx/dt , 

Blx(a)  + B2  x(b)  = 0, 

where  v(/ ) = col  (x, xn)  are  the  //  unknowns.  A(t)  = [u,;(/)]  an  n x n 

matrix  with  real  bounded  measurable  entries,  and  B2  constant  n x n 
matrices.  Thus,  in  (21)  the  underlying  linear  problem  is 

x - A(t)x  = 0.  a < t <h,  Bjx(a)  + B2  x{h)  = 0.  (22) 
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or  Ex  = 0,  including  the  boundary  conditions  in  the  operator  E defined  by 
Ex  = x'  — A(i)x.  As  usual,  a solution  x(t)  = col  (x,,  . . x„),  a < I < b,  is  an 
n x I vector  with  absolutely  continuous  (AC)  entries  in  [a,  h],  satisfying  the 
differential  system  a.e.  in  [u,  b\.  The  case  of  periodic  solutions  of  period  T 
is  included  in  (21)  for  a = 0.  b = 7,  B,  = /,  B2  = -/,  / the  identity  matrix 
in  E". 

Let  | | denote  the  Euclidean  norm  in  En.  In  (2 1 ),  /(/,  x,  x')  = col  (/, /„) 

is  an//  x 1 vector  function  defined  on  [a,  /?]  x £2",  whose  entries  are  measur- 
able in  t for  every  (x,  x)  and  continuous  in  (x,  x')  for  every  t.  Moreover, 
we  assume  here  that  for  any  given  pair  of  constants  R ,,  R2  there  are  two 
other  constants  L,  M such  that 

| /(/,  x,  x')|  < M,  |/(f,  x,  x')  - f(t,  y,  y')|  < L[ | x - >-|  + \x'  - y'\] 

for  all  a < t < b,  |x|,  |y|  < K,,  |x'|,  | >■' | < R2  If  N denotes  the  operator 
defined  in  the  second  member  of  (21),  then  problem  (21)  takes  the  usual 
form  Ex  = Nx. 

For  any  //-vector  z(l)  = (r, r„),  a < l < b.  we  denote  by  ||zj|0  the  usual 

Sup  norm,  or 

Mlo  = Sup  |z(f)|, 

u<l  <b 

and  by  |jz||,  the  /M-norm 

Iklli  = (/?  - “)  1 1 I ’(ol  dt. 

* a 

Let  X denote  the  (Sobolev)  space  of  all  AC  vector  functions  x(r)  = 
(Xj,  . x„),  a < t < b , for  which  we  may  well  take  the  norm 

INI',  = ||  -x:  ||  o + Kill 

(since  this  norm  is  equivalent  to  the  Sobolev  norm  ||x||,  + ||x'||,).  Let  Y 
be  the  space  of  all  vector  functions  _>•(/)  = (vt  •••,  ,V„)  with  L,-integrable 
entries,  with  norm  |ly 

Let  A denote  the  transpose  of  any  given  matrix  A.  Then  the  linear 
problem  adjoint  to  (15)  is 

dy/dt  + yA(t ) =•  0.  y(a)  = aB„  y(b)  = -yli2,  (23) 

or 

dy/dt  + A(t)\  — 0,  y(a)  — B,a.  y(b)  = - B2,x.  (23  ) 

where  y(t)  - col  (y, y„),  or  y(t)  = row  (y,,  yj,  and  where  y = 

col  (y, yn)  denotes  any  arbitrary  real  //  x I vector  (parametric  form  of 
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the  adjoint  problem).  Then,  the  linear  operator  £*  adjoint  to  £ is  defined 
by  relation  (23)  or  (23  ).  and  as  usual  ker  £ = coker  £*.  coker  £ = ker  £*. 
We  denote  by  p and  q the  dimensions  of  ker  £ and  coker  £.  respectively, 
bredholm’s  alternative  theorem  now  has  the  usual  form.  Moreover,  projec- 
tion operators  P0:  X -»  X . Q,j.  Y -*  Y.  mapping  X onto  X 0 = P0(X ) = ker  £. 
and  Y onto  Y0  = Q(  Y)  = coker  £'.  can  be  defined  here  by  simple  algebraic 
operations  (see  Hale  [14.  p.  26.3],  Nagle  [30]). 

Then  problem  (1),  or  Ex  — Nx,  can  be  reduced  to  a system  (2).  (3)  of 
auxiliary  and  bifurcation  equations.  By  an  analysis  too  detailed  to  be 
reported  here  concerning  the  interplay  of  £,  and  Sup  norms,  the  auxiliary 
equation  can  be  handled  as  a Lipschitz  map.  Thus  the  same  equation  is 
solvable  if  the  Lipschitz  constant  is  <1.  For  / replaced  by  i.q  in  (21). 
where  i.  is  a “small"  parameter,  this  Lipschitz  constant  can  be  made  < 1 
by  taking  \i. | sufficiently  small.  Thus,  we  are  reduced  to  the  bifurcation 
equation,  actually  a system  of  q equations  in  p unknowns  (see  [30]).  For 
q ^ p and  q smooth,  it  is  enough  to  verify  that  the  relevant  q x p Jacobian 
matrix  for  i:  = 0,  which  is  easy  to  compute,  has  maximum  rank  q.  Then  by 
the  implicit  function  theorem,  problem  (21 ) has  at  least  one  solution. 

Theorems  are  proved  in  [30]  that  extend  to  these  perturbation  boundary- 
value  problems,  results  proved  by  Hale  [14],  Cesari  [7],  and  Mawhin  [25] 
lor  periodic  solutions  only,  smooth  solutions,  and  / or  q independent  of 
derivatives  of  maximum  order.  The  follow  ing  examples  concerning  periodic- 
solutions  may  be  of  interest  (for  details  see  [30]). 

for  periodic  solutions,  as  well  as  for  other  particular  situations,  the 
bifurcation  equations  and  their  Jacobian  matrix  at  ;;  = 0 have  been  written 
explicitly. 

Example  1 

x"  + n1  x = i;(l  — x2  )x'  + rxiw  ' x"  + i.boj( cos  <»l  + a) 

for  »,  > 0 a small  parameter,  n.  a.  h.  7 constants.  <n  = 7 2n.  has  periodic 
solutions  of  the  form 

x(r.  i.)  = 1 sin  [a»(f)  + <Hi:)]  + (Mi  ). 

Me)  - /0  + (Mi  ).  (Hi.)  = 0O  + 0(4 

where  0O  satisfy  the  equations 

/. ' - 4 pa) 1 cos  (a  — 0)  = 0,  a/.  + pw  sin  (7  — 0)  = 0. 

In  this  example  p = q = 2. 
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Example  2 

x"  = e[  — ex  + x2  sin2  f] 

for  e > 0 a small  parameter,  has  27r-periodic  solutions  of  the  form  x(f)  = 
x + 0(c)  with  a = +21  2.  In  this  example  p = q ~ 1. 

Hale's  concept  [14,  p.  267]  of  symmetric  systems  is  extended  in  [30]  to 
boundary-value  problems  (21)  of  the  form 

x'(t)-  A(t)x=f{t,x).  —a<t<a. 

Bjx(  — «)  -I-  B2  x{a)  = 0. 

We  say  that  (24)  has  a symmetry  property  S if  there  is  an  n x n constant 
matrix  S such  that  (a)  S2  = I ; (b)  Sf(  — t , Sx)  = —f(t,  x);  (c)  if  r(f). 
-a<t<a.  satisfies  B1:(  — a)  + B2z{ci)  = 0.  so  does  Sz(  — f);  (d)  if  :(t). 
-a  < r < a.  satisfies  the  adjoint  boundary  condition  z(  — a ) = y.Bx,  z(a)  = 
-5B2,  so  does  Sz(-t). 

Under  these  hypotheses,  the  q components  of  the  bifurcation  equation 
can  be  proved  to  be  linearly  dependent  [30],  and  thus  we  may  expect  a 
family  of  solutions  to  problem  (24)  depending  on  a suitable  number  of 
parameters. 

The  following  two  examples  from  [30]  containing  a small  parameter  c 
both  possess  a family  of  solutions  satisfying  the  given  boundary  conditions: 

Example  1 

Xi  = x2  + x),  ./,(-?-  *i,  ~x2.x3)  = - J\(t , x), 

X2  = -Xi  + e/2( f,  a),  f2{-U  Xj,  -x2,  x3)  =/2(t,  x). 

x3'  = e/3(fi  x).  ,/3(  - (.  Xj,  -x2 . x3)  = -/3(f,  x) 

X j ( 7l)  X , ( 7T ) = 0,  x2(-7l)  + X2(ti)  = 0,  X3(  - Jt)  - X3(lt)  = 0, 

S = (1.  0.  0;  0.  - 1,  0:  0.  0,  1).  x = (x,.  x2.  x3). 

The  system  has  a two-parameter  family  of  solutions.  xt,  x3  even.  x2  odd, 
of  the  form 

Xj  = A cos  t + 0(g).  x2  = -A  sin  f + 0(e).  x3  = c + 0(e). 

A,  c arbitrary,  |e|  sufficiently  small. 

Example  2 

xi'  = x2  + efiit.  x).  x2'  = x3  + e/2(f,  x).  x3'  = e/3(f,  x). 

where  —a<t<a.  Xj(  — a)  — x,(u)  = 0,  x2(  — a)  + x2(a)  = 0.  x3(—a) 
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— x3(a)  = 0,  /,,  J2,  J3,  and  S as  in  Example  1.  The  system  has  a two- 
parameter  family  of  solutions,  xx,  x3  even,  x2  odd,  of  the  form 

.v,  = x,  + x2 12  + 0(f:).  x2  = 2x2  t + 0(e),  x3  = 2x2  + 0(e), 

x,.  x2  arbitrary.  |e|  sufficiently  small. 

11.  Nonlinear,  Non-Self-Adjoint  Elliptic  Problems  of  Order  m 
with  Nontrivial  Kernel — Local  Analysis 

Let  us  consider  with  Shaw  [36]  the  general  problem 

Ex  = fit,  x,  Dx Dmx),  feCc  £', 

(25) 

Bx  = 0.  tecG.  r = (t, t,), 

where  £ is  a linear  elliptic  operator  of  order  m in  a region  G of  the 
v-dimensional  space  £',  with  linear  homogeneous  boundary  conditions  on 
fG.  Of  particular  interest  is  the  case  where  there  are  nonzero  functions 
x0(t).  t € G.  with  £.v0  = 0.  Bx0  = 0.  that  is.  the  fundamental  operator  pair 
(£.  B)  has  nontrivial  null  space.  A particular  case  is.  of  course,  the  Neumann 
problem  with  £ = A and  boundary  condition  dx/dn  = 0 on  cG.  whose  null 
space  is  made  up  of  all  constant  functions  on  G.  In  (25)  the  operator  £ 
has  the  form 

£.y  = X O’  = dW/dt\'  ■ ■ ■ ct 

|i|<m 

with  smooth  real  coefficients  satisfying  a uniformly  elliptic  condition 

c“  1 Km|  < I aM**c\tr 
1*1  = m 

for  some  constant  c > 0 and  all  c = (^l5  — cv)  real. 

Here  G is  a bounded,  connected  open  subset  of  £'  with  piecewise  smooth 

boundary.  In  (25). /(r.  x.  Dx Dm.\)  denotes  a smooth  real-valued  function 

defined  on  (G  u cG)  x RN  + V,  where  N = 1 + v + v(v  — l)/2  + is  the 

number  of  all  possible  distinct  partial  derivatives  of  orders  <m  in  R'.  We 
allow  in  /derivatives  of  order  <m.  that  is,  up  to  and  including  the  order 
m of  the  operator  £. 

Thus,  including  the  boundary  conditions  Bx  = 0 in  the  definition  of  the 
operator  £.  and  denoting  by  N the  operator  defined  by  the  second  member 
of  (25),  problem  (25)  takes  the  usual  form  Ex  — Nx. 

The  main  assumption  is  that  the  pair  (£,  B)  of  linear  operators  is  coercive 
in  the  sense  of  Agmon.  Doughs,  Nirenberg.  a condition  algebraic  in  nature. 
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Then,  (E,  B)  has  important  properties  as  a pair  of  differential  operators. 

Let  us  consider  the  Sobolev  spaces  X = Hm  + r(G)  and  Y = Hr(G)  for  r 
integer,  with  usual  norms  ||  ||m  + r,  ||  jjr.  Let  D be  the  subspace  of  X of  all 
x e X satisfying  Bx  = 0.  Then.  E:  D -»  Y is  defined  on  D , and  a con- 
sequence of  the  coercivity  assumption  is  that  for  every  x e D we  have 

||x||m+r  < C(||£.\'||r  + ||x||°) 

for  some  constant  C.  Another  consequence  of  the  coercivity  is  that  both 
X0  = kerEand  Y0  = coker  E have  finite  dimensions,  say  p and  q,  respectively, 
and  there  exist  projection  operators  P:  X ->  X,  Q:  Y -»  Y,  with  P(X)  = X0, 
(3(F)  = Y0  (see  [36]  for  proofs,  details,  and  statements).  A consequence  of 
the  assumption  r > v/2  is  that,  for  xeHr  + m,  by  Sobolev’s  imbedding 
theorems,  x and  all  derivatives  D*x,  0 < |x|  < m,  appearing  in  the  function 
/ are  continuous  in  G u cG.  and  moreover  their  Sup  norms  in  G u cG  are 
not  larger  than  CJ||x|jr  + m,  0 < |a|  < m,  for  suitable  constants  C«.  Thus,  the 
arguments  in  / remain  bounded  in  the  Sup  norm  if  ||x!|r  + m is  bounded. 
Since/was  assumed  to  be  smooth,  then  / is  Lipschitzian  in  its  own  arguments 

in  each  fixed  ball.  It  can  be  proved  that/(f,  x Dmx)  is  a Lipschitz  function 

of  x in  the  topology  of  Hr(G)  for  ||x||r  + ,n  below  any  given  constant  M. 

By  replacing /in  (25)  by  e.g,  where  c is  a small  parameter,  the  auxiliary 
equation  is  solvable  by  contraction  maps  and  the  Banach  fixed  point  theorem, 
at  least  for  j e | sufficiently  small.  It  turns  out  that  the  present  considerations, 
based  on  Sobolev's  imbedding  theorems,  allow  remarkably  good  estimates 
of  e.  For  p = q = 1,  the  bifurcation  equation  is  a real  equation  in  one 
real  unknown.  To  exhibit  the  power  of  the  present  approach,  we  present 
here  from  [36]  a few  examples,  with  p = q = 1,  e = 1,  one  with  nonlinearity 
including  derivatives  of  maximal  order. 


Example  1 

Ax  -I-  2x  = k — arc  tan  x on  G — [(c,  q).  0 < c,  q < 7r]. 
x = 0 on  rG. 

Here  /.  = 2 is  an  eigenvalue  of  the  operator  A on  G with  Dirichlet  boundary 
conditions  and  eigenfunction  sin  c sin  q.  (c,  q)  e G.  It  was  shown  in  [36]  that 
the  problem  is  solvable  if  and  only  if  \k\  < n 2.  Moreover,  for  \k  \ < n 2. 
the  solution  is  then  of  the  form 

x(c.  q)  = v d(c.  q)  + d sin  £ sin  q. 
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with 

) L'd(£)  f])  sin  £ sin  rj  dtj  — 0,  ||rd!|/.2  < 7t2/3, 

* G 

and  a suitable  value  of  the  constant  d. 

Example  2 

Ax  = sin  2nq  + 2~  1 V 3[x2(l  + x2T  ’]  + cos  (c,  '/)  e G- 

dx/dn  — 0 on  cG.  G = [(£,  t])\0  < £,  t]  < 1]. 

The  problem  has  a solution  of  the  form  x(£,  rj)  = d + v(q,  t]).  d a suitable 
constant,  and  jG  v (q,  rj)  dc  dr]  = 0. 

Example  3 

Ax  = /(£,  rj)  ± 10"  ‘x3  on  G = [(£,  rj)\0  < £,  t]  < 1], 
dx/cn  = 0 on  cG. 

where  / is  a given  measurable  bounded  function  on  G.  The  problem  has  a 
solution  for  |/|  < 1/20. 


12.  The  Theorem  of  Landesman  and  Lazer 


We  consider  here  the  problem  at  resonance 

Ex  + ax  + y(x)  = h(t),  t e G. 

x = 0.  re  cG, 


(26) 


where  G is  a bounded  domain  in  £'  with  smooth  boundary  rG,  and  E is 
the  elliptic  operator 

Ex  = X (-1  pD^^x), 

1*1.  I/J!  S m 

r — (f1 i')eG.  ajp  = api  are  bounded  real-valued  functions  on  G.  with 

aip  uniformly  continuous  on  G for  |a|  = |/f|  = m , and 

X axfft+P'  Qy'  + IK  > c(c  i 2 + •••  + t*r 

1*1  ~\P\=m 

for  all  (c, cje  £'  and  some  c > 0.  In  (26).  a is  an  eigenvalue  with 

eigenspace  W — {wj  of  (finite)  dimension  k > 1 for  the  underlying  linear 
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problem  Ex  = 0 on  C,  x = 0 on  cG.  Also,  h e L2(G),  and  g:  E 1 — * E1  is  a 
given  continuous  function  for  which  the  limits  R = g(  + go).  r = g(  — oc.) 
exist  and  are  finite,  and  r < g(s)  < R for  all  s real. 

For  any  eigenfunction  we  W.  w # 0,  let  G f , G~  denote  the  sets  of  points 
teG  where  w(t)  > 0,  w(t)  < 0,  respectively,  and  let  w+ = jG*  |w|  df, 
w~  = JG  |w|  dt.  Then 

rw+  — Rw~  < (/?,  w)  < Rw+  — rw~ 

is  a necessary  condition  for  (26)  to  have  a weak  solution  x e Hm(G). 
Moreover, 

rw+  — Rw~  < (/i,  w)  < Rw+  — rw~ 
is  a sufficient  condition  for  (26)  to  have  a weak  solution  x e Hm(G). 

This  theorem  was  proved  by  Landesman  and  Lazer  [21]  for  m = 2,  k = 1. 
and  extended  by  Williams  [38]  as  stated.  Both  proofs  are  based  on  the 
alternative  method  and  the  use  of  the  Schauder  fixed-point  theorem  applied 
to  the  map  S,  x S0  x S0  represented  by  the  system  of  auxiliary  and 
bifurcation  equations.  [Other  extensions  have  been  given  by  Necas  again  by 
Schauder's  theorem  applied  to  the  same  map,  and  by  Franchetta  by  the 
use  of  Leray-Schauder  degree  associated  to  the  same  map  (coincidence 
degree).]  Shaw  [36]  has  recently  extended  in  various  ways  the  sufficiency 

condition  for  problems  of  the  form  Ex=f(t,  x,  Dx Dm_1.v)  on  G, 

Bx  = 0 on  r'G,  £ and  B as  in  Section  1 1.  Under  hypotheses  Shaw  has  shown 
that  the  auxiliary  equation  can  be  solved  by  Schauder’s  fixed-point  theorem 
yielding  a set-valued  function  £(.x*),  and  then  the  bifurcation  equation  is  also 
solvable,  as  Shaw  has  proved  by  algebraic  topology  argument  and  exact 
sequences  of  cohomology  groups.  (Also,  De  Figuereido,  Fucik,  and  Hess, 
and  Fucik,  Kucera,  and  Necas  have  given  independent  proofs  of  the  Landes- 
man and  Lazer  theorem,  and  parallel  results.) 
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1.  Introduction 

Topological  arguments,  and  in  particular  degree  theory , are  now  quite 
classical  in  existence  problems  for  nonlinear  equations  and,  during  these  last 
decades,  progress  in  this  field  has  followed  very  closely,  and  often  suggested, 
the  development  of  some  parts  of  topology.  Let  us  recall  that  in  1883, 
Poincare  [59],  motivated  by  problems  of  celestial  mechanics,  had  already 
used  Kronecker's  index  [32]  to  give  a formulation  of  the  so-called  Miranda 
theorem  [54]  and  that  Bohl  [2]  in  1904  anticipated  the  Brouwer  fixed-point 
theorem  [4]  in  his  work  on  differential  equations  of  mechanical  systems. 
The  applications  of  Poincare  and  Bohl  were  local  in  nature  and  we  had  to 
wait  for  Lefschetz  [38]  and  Levinson  [41]  in  1943  to  see  the  Brouwer  fixed- 
point  theorem  applied  to  periodic  solutions  problems  for  strongly  nonlinear 
forced  Lienard  equations.  From  this  moment,  Poincare  s method  of  reducing 
the  search  of  periodic  solutions  to  that  of  fixed  points  for  the  operator  of 
translation  over  one  period  and  its  study  by  topological  theorems  were 
used  successfully  by  many  authors  (see,  e.g..  the  books  of  Sansone  and 
Conti  [66],  Lefschetz  [39],  Reissig  et  al.  [62],  and  Krasnosel’skii  [31]). 

In  the  meantime,  however,  another  approach  for  boundary  value  problems 
in  nonlinear  equations  had  emerged  from  the  extension  by  Birkhoff  and 
Kellog  [1],  Schauder  [67],  and  Caccioppoli  [9]  of  the  Brouwer  fixed-point 
theorem  to  some  mappings  in  infinite-dimensional  spaces.  This  current 
culminated  in  the  famous  paper  of  Leray  and  Schauder  [40]  on  degree  theory 
for  compact  perturbations  of  identity  in  Banach  spaces,  whose  birth  was 
closely  related  to  nonlinear  boundary  value  problems  for  elliptic  partial 
differential  equations  (see,  e.g..  Cronin  [16],  Ladyzenskaja  and  Ural'ceva 
[34],  Browder  [5]).  Curiously,  the  machinery  of  Leray-Schauder  degree  was 
applied  somewhat  later  to  ordinary  differential  equations  and,  in  particular, 
only  in  1952,  by  Stoppelli  [71],  to  a problem  of  periodic  solutions  (see, 
e.g.,  Rouche  and  Mawhin  [63]  for  a bibliography  of  more  recent  work).  Up 
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to  that  time,  all  the  global  existence  theorems  proved  in  various  fields  by 
Lera>  Schauder  theory  could  be  reduced  to  the  common  abstract  form 

Lx  = iV(x,  1), 

where  L has  an  inverse,  L 1 N(\  •)  has  compactness  properties,  N[\  0)  = 0, 
and  all  possible  solutions  of  each  equation 

Lx  = N(x,  /),  / e JO,  1[ 

are  a priori  bounded  independently  of  /.  This  abstract  situation  is  usually 
known  as  the  Leray  Schauder  continuation  method. 

Problems  corresponding  to  equations 

Lx  = Nx  (1.1) 

with  L noninvertible  but  ker  L finite  dimensional  had,  however,  received 
attention  in  the  first  decade  of  the  century  in  the  work  of  Lyapunov  [43] 
and  Schmidt  [68]  on  nonlinear  integral  equations.  When  N is  “small”  in 
some  sense,  they  reduced  problems  of  this  type  to  a finite-dimensional 
equation  in  ker  L.  the  bifurcation  equation , a process  referred  to  today  as  an 
alternative  problem.  In  1936,  Caccioppoli  [10]  was  the  first  to  use  topological 
methods, essentially  Brouwer  degree,  to  study  bifurcation  equations,  and  this 
approach  was  substantially  developed  in  1950  by  Cronin  [12]  and  applied 
to  some  local  problems  for  elliptic  partial  differential  equations  [13],  In  his 
basic  paper  extending  alternative  problems  to  some  equations  with  large 
nonlinearities,  Cesari  [11]  also  solved  the  generalized  bifurcation  equation 
using  Brouwer  degree.  Later,  the  author  [44]  used  simultaneously  Cesari’s 
approach,  a priori  estimates,  and  Brouwer  degree  to  give,  in  the  frame  of 
periodic  solutions  of  differential  equations,  a continuation  theorem  for  an 
equation  of  type  (1.1 ) with  noninvertible  L.  Various  proofs  of  this  result,  using 
only  Leray  Schauder  degree,  were  then  given  by  Strygin  [72]  and  the 
author  [45],  who  extended  it  in  [46]  to  operator  equations  in  Banach  spaces. 
Earlier  however.  Lazer  [36]  had  used  the  Schauder  fixed-point  theorem  in 
a subtle  way  to  study  the  periodic  solutions  of  a particular  second-order 
differential  equation  with  a noninvertible  linear  part.  With  Landesman  [35], 
he  applied  the  same  approach  to  a Dirichlet  problem  for  a semilinear 
elliptic  equation,  and  the  result  was  later  generalized  in  various  ways  (see 
Section  5).  In  particular.  Nirenberg  [56]  showed  that  topological  degree  was 
more  adapted  than  Schauder  theorem  for  this  type  of  problem  and  was 
independently  led.  for  elliptic  equations,  to  an  existence  theorem  close  to 
the  spirit  of  Mawhin’s.  With  coincidence  degree  theory  [47],  extending 
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Leray  Schauder  degree  to  some  mappings  of  the  form  L — N,  with  L non- 
invertible,  it  appeared  that  the  author’s  continuation  theorem  cited  above 
was  just  the  natural  extension  in  this  new  frame  of  the  Leray  Schauder 
continuation  method.  Therefore,  it  was  possible  to  unify  many  proofs  of 
old  results  and  to  solve  new  problems,  as  will  be  shown  later. 

It  will  be  the  aim  of  this  work  to  give  a survey  of  recent  existence  results, 
for  operator  or  ordinary,  functional,  and  partial  differential  equations,  which 
fall  into  the  scope  of  coincidence  degree  theory.  As  far  as  the  applications 
are  concerned,  we  have  restricted  ourselves  here  to  equations  with  some 
growth  restriction  on  the  nonlinearities.  For  other  applications  to  ordinary 
and  functional  differential  equations,  as  well  as  for  the  use  of  coincidence 
degree  in  bifurcation  theory,  the  reader  is  referred  to  the  complementary 
survey  papers  [52]  and  [53].  We  also  note  that  the  topological  approach  in 
classical  alternative  problems  can  be  incorporated,  as  shown  in  Section  6, 
into  the  coincidence  degree  in  the  frame  of  2 -contractions  recently  developed 
by  Hetzer  [29],  a subject  that  seems  promising  in  a number  of  applications. 
In  this  chapter,  proofs  are  only  given  when  they  have  not  been  published 
elsewhere  or  when,  because  of  the  attempt  at  unification,  they  differ 
substantially  from  the  original  ones. 

We  hope  that  this  chapter  will  underline  one  of  the  basic  ideas  of  this 
volume  in  showing  how  techniques  in  ordinary,  functional,  and  partial 
differential  equations  can  mutually  influence  each  other  when  integrated  in 
a common  abstract  scheme. 


2.  Coincidence  Degree 

and  a Generalized  Continuation  Theorem 

The  study  of  various  existence  problems  for  differential  equations  with 
non  invertible  linear  part  has  led  the  author  [47]  to  extend  Leray  Schauder 
degree  theory  [40]  to  some  couples  (L,  N)  of  mappings  between  some  vector 
spaces  X and  Z that  we  shall  suppose  here  normed  for  simplicity,  both 
norms  being  denoted  by  | - |.  More  precisely,  let 

(i)  L:  dom  L d X ->  Z be  a linear  Fredholm  mapping  of  index  zero  (i.e., 
such  that  Im  L is  closed  in  Z and 


dim  ker  L = codim  Im  L < oo). 
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(ii)  N:  cl  Qc  X -» Z,  with  ft  open  and  bounded,  a mapping  that  is 
L-compact  on  cl  ft,  i.e.,  such  that 

(a)  n N:  cl  ft -►  coker  L is  continuous  and  n/V(cl  ft)  bounded; 

(b)  KP  Q N:  cl  ft  -*  X is  compact; 

where  FI;  Z -*  coker  L is  the  canonical  surjection  and  KP  Q = KP(I  - Q ), 
with  KP  = (L  | dom  L n ker  P)~  1 and  P:  X ->  X,  Q : Z ->  Z are  continuous 
projectors  such  that 

Im  P = ker  L,  Im  L = ker  Q. 

It  can  be  easily  proved  that  condition  (b)  does  not  depend  on  the  (nonunique) 
choice  of  the  projectors  P and  Q.  Moreover,  if  X = Z and  L = /,  the 
L-compactness  of  N is  nothing  but  its  compactness  on  cl  ft. 

(iii)  When  0 $ (L  - N)( dom  L n bdry  ft),  an  integer  d[(L,  N),  ft],  the 
coincidence  degree  of  L and  N in  ft.  has  been  introduced  in  [47],  which 
depends  only  on  L,  N,  ft,  and  orientations  chosen  on  ker  L and  coker  L.  It 
is  defined  explicitly  by  the  relation 

d[(L  N ),  ft]  = dLS[I  - P - (An  + Kp'Q)N , ft,  0] 

= dLS[l  — (L  + F P q A)_  1 (F p'  Q.  A + N ),  ft.  0] 

where  dLS  denotes  the  Leray- Schauder  degree.  A:  coker  L -*  ker  L is  an 
orientation-preserving  isomorphism,  and 

fp.q.\  = Aa  lp-  Ay  = An | Im  Q. 

It  follows  from  conditions  (i)  (iii)  and  from  the  easily  proved  relation 

I - P - (An  + Kp  q)N  = (An  + Kp  Q)(L  - N ) 

where  All  + KP  Q is  checked  to  be  an  isomorphism  between  Z and  dom  L. 
that  this  Leray  Schauder  degree  is  well  defined,  and  it  is  shown  in  [47]  that 
it  docs  not  depend  on  the  choice  of  P,  Q , and  A.  Moreover,  it  conserves 
most  of  the  basic  properties  of  Leray  Schauder  degree,  as  for  example  the 
additivity,  excision,  and  invariance  with  respect  to  L- compact  homotopies 
preserving  (iii). 

One  can  also  generalize  in  this  frame  the  Leray  Schauder  continuation 
theorem. 


Theorem  2.1.  Let  L and  N satisfy  conditions  (i)  and  (ii)  above  and 
suppose  that  the  following  assumptions  are  verified: 
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1.  For  each  x e dom  L n bdry  and  each  / e JO,  1[. 

Lx  ^ /.Nx. 

2.  For  each  x e ker  L n hdry  Q.  IFVx  # 0 (or  equivalently  QNx  ^ 0). 

3.  dB[ AFI/V  | ker  L.  ii  r ker  L.  0)  / 0 (or  equivalently 

dB[JQN\kcr  L.  il  n ker  L,  0]  ^ 0), 

where  dH  denotes  the  Brouwer  degree  and  J . Im  Q -»  ker  L is  any  isomor- 
phism. 

Then  for  each  / e [0,  1|.  the  equation 

Lx  = /.Nx 

has  at  least  one  solution  in  and 

Lx  = Nx 

has  at  least  one  solution  in  cl  Q. 

lor  a proof  of  this  theorem,  see  (47]  or  [51  J. 

Remark.  When  ker  L = J0J,  conditions  (2)  and  (3)  reduce  to 
2:  0 e il 

as  shown  in  [51]  by  introducing  a Brouwer  degree  in  0-dimensional  spaces, 
and  Theorem  2.1  reduces  then  to  the  usual  Feray  Schauder  continuation 
method. 

3.  Mapping  Theorems  for  Quasi-Bounded  Perturbations 
of  Linear  Fredholm  Operators 

In  this  section,  we  shall  use  Theorem  2.1  to  give  various  mapping 
theorems  relying  on  the  concept  of  quasi-boundedness  for  a nonlinear 
mapping. 

With  X and  / still  real  normed  spaces,  let  / : X -*Z.  The  following 
concept  is  due  to  Granas  [25], 

Definition  3.1.  / is  said  to  be  quasi-bounded  if  the  number  / defined 

by 

I = lim  sup  (|x|  1 1 Fx  | ) 

1*1 -« 

is  finite,  in  which  case  / is  called  the  quasi-norm  of  /■ 
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Throughout  this  section  we  shall  assume  that  L.  dom  Lr  X — > Z is  a linear 
Fredholm  mapping  of  index  zero  and  that  N:  X —>Z  is  L -compact  on 
bounded  sets  of  X Also.  I1  and  Q will  be  continuous  projectors  such  that 
lm  R = ker  /..  Im  /.  = ker  Q.  and  K ,,  (J  is  the  corresponding  mapping  defined 
in  Section  2. 


Theorem  3.1.  Suppose  that  the  following  conditions  are  satisfied: 

(a)  Kh  ,j  N:  X -»  X is  quasi-bounded. 

(b)  There  exist  y.  ^ 0.  r > 0 such  that  each  possible  solution  of 

IFYx  = 0 

satisfies  the  relation 

Rx  | < y.  (!  - R)x  | + r. 

(c)  Kp  qN  < (\  + y) 

(d)  dti[JQN  | ker  I..  IHr)  r ker  L.  0]  / 0. 

Then 

(L  — N)(dom  /,)  o Im  L 
For  a proof  of  this  theorem,  see  [4X]  or  [51]. 

When  X = Z and  /,  = I.  conditions  (b)  and  (d)  are  satisfied  with  y = 0 
and  any  r > 0.  and  assumptions  reduce  t 

V <1. 

This  is  a result  due  to  (iranas  [25] 

To  obtain  a useful  special  case  of  Theorem  3.1,  let  us  suppose  that  X 
is  a (possibly  proper)  subspace  of  the  space  B(S.  Rn ) of  bounded  mappings 
from  some  set  .S’  into  Rn.  with  a norm  such  that 

|x|  > sup  | .X ( s ) | . 

If  V 

In  this  case,  the  following  result  is  proved  in  [51], 


Theorem  3.2.  Suppose  that  the  following  conditions  are  satisfied: 
(a  ) Kh  yN:  X -»  X is  quasi-bounded. 

( b ) There  exists  y > 0 such  that  for  each  u e ker  L and  each  s e S. 

M < *|m(s-)|. 
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(b  ")  There  exists  r,  >0  such  that  for  every  x e dom  L for  which 
|x(.s)|  > /•,,  v e S,  one  has 

IITVx  0. 

(c)  kf  qn j|  < (i  + 7.) 

(d  ) dH[JQN | ker  L.  #(r,)  n ker  L.  0]  ^ 0. 

Then  (L  — N)( dom  L)  3 Im  L. 

The  following  special  case  is  proved  in  [48]  and  [51]. 

Corollary  3.1.  Suppose  that  X is  as  in  Theorem  3.2  with  moreover 

\x\  = sup  | x ( s ) | 

5 € S 

when  x is  a constant  mapping  and  that 

ker  L = {x  e dom  L : x is  a constant  mapping}. 

Suppose  that  assumptions  (a  ),  (b").  (d  ) of  Theorem  3.2  hold  and  that 

\ < l 

Then  (L  — JV)(dom  L)  3 Im  L. 

Let  us  now  come  back  to  a general  normed  space  X . 

Theorem  3.3.  Suppose  that  the  following  conditions  hold: 

1.  There  exist  <)  e [0.  1[,  n > 0.  v > 0 such  that  for  each  x e X. 

\Kp,qNx\  < n\x\*  + v. 

2.  For  every  bounded  V c Im  L.  there  exists  i0  > 0 such  that  for  every 
t > t0 . every  r e V,  and  every  w e ker  L n bdry  B(  1 ).  one  has 

QN(tw  + tA :)  / 0. 

3.  dH \JQN  ker  L.  n ker  L.  0]  ^ 0. 

Then  (/.  - ;V)(dom  L)  3 Im  L. 

Proof.  Let  ye  Im  L.  We  shall  apply  Theorem  2.1  with  Nx  + y instead 
of  Nx.  Each  equation 


Lx  = /.(Nx  + y). 


Ae]().  1]. 


(3.1) 
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is  equivalent  to 


v = aKp  q(N(u  + v)  + y) 
0 - QN{u  + v) 


with 

u = Px,  v = (1  — P)x. 

Then,  for  each  possible  solution  x = //  + v of  (3.2),  we  have 

| v | < n | u + v \s  + v' 


with 

v'  = v + \Kp  Qy\, 

and  hence,  if  | u | ^ 0, 

|u|  |u|”a  < n + /r<5|u|<5_  '(|t’| \u\~d)  + v'| u 
Thus,  there  exists  > 0 such  that,  if  |i/|  > f,, 

n*\ur'<  h 

and  hence 

| v 1 1 1/ 1_<5  < 2(n  + v'tis). 


(3.2) 

(3.3) 


(3.4) 


If 


V = {z  e Im  L:  \z\  <,  2(/i  + v'f,  A). 

then,  by  condition  (2),  there  exists  t0  > 0 such  that,  if  f > t0,  w e ker  L n 
bdry  B(  1 ).  and  z e l< 

QN(tw  + tdz)  * 0. 

Therefore,  if  |w|  > max  (t0,  f,),  |u|  Av  e V,  and 

QN{u  + v)  = QN(\u\(u/\u\)  + |u(V|«H)  / 0 

which  implies  by  (3.3)  that  u + v cannot  be  ? solution  of  (3.1).  Thus,  each 
possible  solution  x = u + v of  (3.1)  is  such  that 

|m|  <t 2 = max  (t0,  f,) 


and  therefore,  by  (3.4), 

M + I®!)4  + v'- 

which  clearly  implies  that  |v|  < t3,  with  t3  the  unique  positive  solution  of 

a - nU2  \ a Y - v'  = 0. 


TOPOLOGY  AND  NONLINEAR  BOUNDARY  VALUE  PROBLEMS  59 

It  is  then  easily  checked  that  all  the  conditions  of  Theorem  3.2  hold  with 
Q = B(t2  + (3),  and  the  proof  is  complete. 

When  S — 0,  it  is  clear  from  the  proof  that  if  assumption  (2)  is  replaced  by 

2'.  There  exists  t0  > 0 such  that  for  every  / > t0,  every  z e cl  B(p  + v), 
and  every  w e ker  L n bdry  B(  I),  one  has  QN(tw  + :)  ^ 0. 

ThenO  e (L  — /V)(dom  L).  The  corresponding  theorem  is  due  to  Cronin  [15], 
Let  us  now  assume  that  X = Z = H,  a Hilbert  space  with  inner  product 
( , ),  that  L is  self-adjoint  (hence  dom  L is  necessarily  dense  in  H),  and  let 
us  give  some  consequences  of  Theorem  3.3  that  were  initially  proved  by 
Necas  [55]  and  Fucik  et  al.  [24], 


Corollary  3.2.  Suppose  that  assumption  (I)  of  Theorem  3.3  and  the 
following  condition 

2".  For  every  bounded  V c Im  L.  there  exists  l0  > 0 such  that  for  every 
t > to-,  every  ; e V,  and  every  w e ker  L n bdry  B(  1),  one  has 

(N(tw  + tAz),  w)  > 0, 

hold.  Then  (L  — N)(dom  L)  =3  Im  L. 

Proof.  Using  the  self-adjoint  character  of  L.  one  can  take 

n 

Px  - Qx  = X (x,  w,)w,  , 

(=  i 

where  (w,,  ....  wj  is  an  orthonormed  basis  of  ker  L.  P and  Q are  then 
orthogonal  projectors  and,  using  (2"),  one  gets 

(QN(tw  + tsz),  w)  = ( N(tw  + t‘sr),  w)  > 0, 

which  shows  that  condition  (2)  of  Theorem  3.3  is  satisfied.  Also,  if  f0 
corresponds  to  V = [0],  then  for  t > t0  and  w e ker  L n bdry  B(  1 ) 

(QN(tw),  w)  > 0. 

which,  by  the  Poincare  Bohl  theorem,  implies  that 

(Ih[QN  | ker  L.  B(t0)  n ker  L,  0]  = 1, 


and  the  proof  is  complete. 
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Corollary  3.3.  Let  F\  H -*H  be  L-compact  on  closed  bounded  sets  of 
H such  that  assumption  (1)  of  Theorem  3.3  holds  with  N.  replaced  by  f . 
Suppose  that  there  exists  S:  ker  L n bdry  B(\) ->  R such  that  for  each 
bounded  V cz  Im  L.  there  exists  tQ  > 0 such  that  for  every  t > t0 . every  re  V. 
and  every  w e ker  L n bdry  B(  1 ),  one  has 

(F(tw  + tdz),  w)  > S(w) 

(S  is  then  said  to  be  a weak  <5-subasymptote  to  F with  respect  to  ker  L).  Then 
for  each  y e H such  that 

(y,  w)  < S(w),  w e ker  L n bdry  B(  1 ). 

the  equation 

Lx  = Fx  — y 

has  at  least  one  solution. 


Corollary  3.4.  Let  F be  as  in  Corollary  3.3  and  suppose  that  for  every 
k > 0 and  every  bounded  K c Im  L there  exists  t0>  0 such  that  for  each 
t > t0 , each  w e ker  L n bdry  B(  1 ).  and  each  z e V,  one  has 

(F(f\v  + tAz),  w)  > k . 

then  L ~ F is  onto. 


Corollary  3.5.  Let  F be  as  in  Corollary  3.3  and  suppose  that  there  exists 
S* : ker  L n bdry  B(\)  -*•  R such  that 

(F(x),  w)  < S*(w).  x e H.  we  ker  L n bdry  B(l), 

and  such  that  for  each  i:  > 0 and  each  bounded  V <=  H.  there  exists  t0  > 0 
such  that  for  every  t > t0 , every  z e V,  and  every  wekerLn  bdry  B(  1 ).  one 
has 

|(F(fw  + tsz),  w)  - S*(w)|  < e.« 

Then 


Lx  = Fx  — y,  ye  FI 


has  a solution  if  and  only  if 


(y,  w)  < s*(w). 


w e ker  L n bdry  B(  1 ). 
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Remark.  The  conditions  in  Corollaries  3.4  and  3.5  being  independent 
of  the  sign  of  L.  the  same  results  hold  for 

Lx  + Fx  = y. 

Let  us  come  back  now  to  normed  spaces  X and  Z but  assume  that 
there  exists  an  inner  product  space  U with  inner  product  ( , ) and  a normed 
space  V with  norm  • ||,  such  that  X c U,  Z <=  U.  and  X cz  V topologically. 
Let  L:  dom  L cz  X -*  Z be  a linear  Fredholm  mapping  of  index  zero  and 
F:  X ->  Z be  L-compact  on  closed  bounded  sets  of  X.  Assume  also  that 
there  exists  a continuous  projector  Q:  Z -*  Z such  that  Im  L = ker  Q and 
which  is  orthogonal  for  the  inner  product  in  U.  The  following  result  has 
been  given  by  Fabry  and  Franchetti  [20]  as  an  application  of  Theorem  2.1. 
We  give  here  a proof  using  Theorem  3.3. 


Theorem  3.4.  Let  us  assume  that  the  following  conditions  hold: 

(i)  There  exist  S e [0,  1[.  H.  p\  v,  v'  > 0 such  that  for  each  .x  e X , 

\Kp'QF\£fi\x\s+  v,  \Fx\£h'\x\6+V. 

(ii)  There  exist  linear  continuous  mappings  H:  X -» Z,  G:  X -*  X,  a 
constant  /f  e [0,  <5[,  with  25  < 1 + (L  and  constants  a > 0,  b > 0 such  that 

(a)  A/ 1 ker  L is  an  isomorphism  onto  Im  Q; 

(b)  G|ker  L is  one-to-one; 

(c)  (Fix,  Nx)  > a||Gx|| 1 +I1  — b for  each  .x  e X. 

Then  if  /?  > 0,  L — F is  onto  and  if  fi  = 0 and  y e Z is  such  that 

sup  \\Gw\\~  l(Hw,  Qy)  < a (3.5) 

»•  6 ker  L n bdry  B(  I ) 

the  equation 


Lx  — Fx  = y 


has  at  least  one  solution. 


Proof.  We  shall  apply  Theorem  3.3  with  Nx  = Fx  + y and  show  that 
condition  (ii)  implies  conditions  (2)  and  (3)  of  this  theorem.  If  (2)  is  not 
satisfied,  then  there  exists  a bounded  V <=  Im  L,  a sequence  ( tn ) with  tn  > 0, 
n e N*.  and  t„  ->  oo  if  n -»  oo,  a sequence  (rn)  with  z„  e V,  n e N*.  and  a 
sequence  (wn)  with  w„  e ker  L n bdry  B(  1 ) such  that 

QF(t„w„  + tnfz„)  + Qy  = 0. 


(3.6) 
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If  necessary,  by  taking  a subsequence  we  can  assume  that  w„  ->  w in  A\  with 
we  ker  L n bdry  B(l),  which  implies  in  turn  that  w„  + t?,~'zn->w  in  X 
when  n -*  oo.  Now,  using  (3.6),  we  have,  for  each  n e N*. 

0 = t~ilJrp)(H(tnwn),  QF(t„  w„  + tnsz„)  + Qy) 

= t;(1+P)(H(tn  w„),  F(tn  w„  + tnsz„)  + Qy)  (by  the  orthogonality  of  Q) 

>t^+^4G(tnWn  + ~K{l  + l>) 

- t;ll+P)(H(tndzn),  F(tnwn  + tnszn)) 

+ t~p(Hwn.  Qy)  [by  the  use  of  (ii-c)] 

> a||G(wn  + C ‘zJH1^  - btn{1  + Pl  -k\H\t2nA-'-p 

x (M'|w„  + C xzn\s  + v't~d)  + t;p(Hwn,  Qy)  [use  of  the 
topological  embedding  of  X and  Z in  U.  of  condition  (i),  and  of  the 
fact  that  (zj  is  a bounded  sequence] 

where  k is  a positive  constant  related  to  the  above  embedding  and  V. 

Therefore,  if  /i  > 0 we  obtain,  if  n -►  oo.  using  the  fact  that  X c=  V, 
topologically, 

0 > a : Gw||1+P, 

which  contradicts  condition  (ii-b).  If  fi  = 0.  we  have  similarly 

0 > a Gw ||  + (Hw.  Qy), 

a contradiction  with  (3.5).  Thus,  condition  (2)  of  Theorem  3.3  is  verified. 

Now  we  deduce  from  (ii-c),  if  x e ker  L,  J = H*,  the  adjoint  of  H | ker  L 
[i.e..  (Hx,  y)  = (x.  H*y)  for  x e ker  L.  >’  e Im  0], 

(Hx,  Fx  + >•)  - (Hx.  QFx  + Qy)  - (x.  J(QFx  + Qy)) 
and.  from  (ii). 

(Hx,  Fx  + y)  > a Gx  1 "p  + b + (Hx.  Qy)  > 0 
if  | x | > r . with  r sufficiently  large.  Then,  using  the  Poincare  Bohl  theorem. 

dB[J(QF(  ) + Qy).  B(r)  n ker  L.  0]  = 1 
and  the  proof  is  complete. 

4.  Periodic  Solutions  of  Ordinary 

and  Functional  Differential  Equations 

Corollary  3.1  can  be  used  to  prove  the  existence  of  T-periodic  solutions 
for  a class  of  retarded  functional  differential  equations.  If  />  0 is  an  integer 
we  shall  denote  by  PTl  the  (Banach)  space  of  mappings  x:  R -*  R"  that  are 
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continuous  and  T-periodic  together  with  their  first  / derivatives,  with  the 
norm 

1*1/  = Z SUP  |*U,(0I 

j = 0 I e K 

with  xij)  = dJx/dt\  and  | • | some  norm  in  Rn.  For  some  r > 0,  let  Cr  be  the 
(Banach)  space  of  continuous  mappings  cp:  [ — r,  0]  -»  R"  with  the  norm 

<P  = sup  \<p(0)\. 

8e[-r.  0) 

When  r = 0,  Cr  will  be  naturally  identified  with  R".  Now.  if  x e P,1  and 
t e R.  and  if  j is  an  integer  between  0 and  /,  we  shall  denote  as  usual  by 
xjj)  the  element  of  Cr  defined  by 

x\J):  [ — r,  0]  -»  Rn,  0^->  xU)(t  + 0), 

and  when  r = 0,  .v}-"  will  be  naturally  identified  with  x(J)(t). 

If  k > 0 is  an  integer,  Ax,  Ak  are  ( n x n)-real  matrices  and 

/:  R x Cr  x x Cr~*  Rn,  ( t , q>x <p i, ....  < Pk+1 ) 

is  T-periodic  with  respect  to  f,  continuous,  and  takes  bounded  sets  into 
bounded  sets,  let  us  consider  the  retarded  functional  differential  equation 

xik  + 1 » + A , x(k)  + • • • + Ak  x'  = f(t,  x, , x,',  ....  xj“).  (4. 1 ) 

The  following  existence  theorem  has  been  proved  in  [49]  as  a consequence 
of  Corollary  3.1  with  X = P k,  Z — P T° , and  L and  N defined  in  an  obvious 
way. 


Theorem  4.1.  Suppose  that  the  following  conditions  are  satisfied: 

1.  The  equation 

det  (/.*/  + /.k~,Al  + •••  + Ak)  = 0 
has  no  root  of  the  form  2n  im  T' 1 with  tn  a nonzero  integer. 

2-  limsup  (||  <p,  +■••+  <pk+ 1 |i )” 1 |/(f,  (Pi (pk+  , ) j = 0, 

<P |||  + -+  I -*X 

uniformly  in  t e R. 

3.  There  exists  R > 0 such  that 

T1  \ ' f(t,y„...,y\k')dt  ^0 
■ o 

for  every  ye  PkT* ' that  verifies  |y(f)|  > R.  t e R. 
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4.  dB[F,  B(R),  0]*0,  with 

. T 

F:Rn->R\  a^T-'  | /(*,  a,  0 0)  dt. 

■ o 

Then.  Eq.  (4.1)  has  at  least  one  T-periodic  solution. 

It  is  shown  in  [49]  that  Theorem  4.1  contains  as  special  cases  or  easy 
consequences  earlier  results  of  Lazer  [36],  Villari  [76],  Sedsiwy  [69],  Reissig 
[61]  for  ordinary  differential  equations,  and  Fennell  [21]  for  retarded 
functional  differential  equations. 

For  the  neutral  equation 

(d/dt)D(t)x,=f(t,x,),  (4.2) 

where  / is  as  above  and  D:  R x Cr  — ► Rn.  (t,  </>)►—►  D(t)q).  is  T-periodic  and 
continuous  with  respect  to  f,  linear  and  continuous  with  respect  to  <p  for 
each  t e R,  and  stable  [which  means  that  the  zero  solution  of  equation 
D(t)y,  = 0 is  uniformly  asymptotically  stable].  Hale  and  the  author  have 
used  Theorem  3.1  (or  equivalently  3.2)  to  prove  the  following  result  [27], 

Theorem  4.2.  Suppose  that  the  following  conditions  are  satisfied: 

1.  lim  sup  w (f>  ~ 1 \f(t.  <p)|  = 0 uniformly  in  t e R. 

2.  There  exists  n>0  such  that  |(Mc)(t)|  > p\c\  for  every  teR  and 
every  ce  Rn.  where  Me  is  the  unique  T-periodic  solution  of  the  equation 
D(t)y,  = c. 

3.  There  exists  Rl  > 0 such  that 

| f(s,  ys)  ds  * 0 
• o 

for  every  ye  PT°  verifying  infteR  |y(f)|  > R i- 

4.  dR[F.  QR,  0]  * 0,  with 

. r 

F:  Rn  R",  T'1  | f(t,(Ma),)dt 

• o 

and 

Q K = \ae  Rn:  Mae  B(R)\.  R = n~'\M\R1. 

Then.  Eq.  (4.2)  has  at  least  one  T-periodic  solution. 

Other  results  concerning  equations  of  type  (4.2).  and  especially  one  arising 
from  a transmission  line  problem,  can  be  found  in  [27], 
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Let  us  now  consider  the  ordinary  scalar  differential  equation 

x”  + m2x  = h{t ) + g(t , x),  (4.3) 

where  m >1  is  an  integer,  h : R -*  R is  27i-periodic  and  continuous,  an  J 
g:  R x R -»  R is  2^-periodic  with  respect  to  t and  continuous.  Using 
Theorem  3.4,  Fabry  and  Franchetti  [20]  have  proved  the  following  theorem. 


Theorem  4.3.  Suppose  that  the  following  conditions  are  satisfied: 

1.  There  exists  r)  e [0,  \[  such  that 

lim  |x|a|^(f,  x)|  =0 
l*|-x 

uniformly  in  t e R. 

2.  There  exists  p > 0 such  that  for  /'  = 1 or  -1,  for  each  t e R , and 
for  each  x e R verifying  |x|  > p.  one  has 

/(sign  x)g(t,  x)  > in  A. 

where 

. 2n  ' 2n 

A = (a2  + b2)'  2,  a = n~  1 | h(t)  cos  nl  dt,  b = n~  1 \ h(t)  sin  nt  dt. 

• o • o 

Then  Eq.  (4.3)  has  at  least  one  27r-periodic  solution. 

This  theorem  generalized  an  earlier  result  of  Lazer  and  Leach  [37],  Other 
existence  theorems  for  second-order  vector  differential  equations  have  also 
been  deduced  from  Theorem  3.4  by  Fabry  and  Franchetti  in  [20], 


5.  Boundary  Value  Problems 

for  Some  Semilinear  Elliptic  Partial  Differential  Equations 

Let  D be  a bounded  domain  in  Rn  and  a„p,  0 < |a|,  \(i\  < m real-valued 
L’  (D)-functions  with  a,p,  for  | a | = \(i\  =m  uniformly  continuous  when 
m > 2.  As  usual,  y.  = (x,,  ....  xj,  x,  e N,  and  |x|  = £"=1  xf.  Let  us  assume 
that  a,p  = aPl  and  that  there  exists  a constant  c > 0 such  that 

I aJt)Q^P>C  |c|2m 

1*1  =»".  \p\  = m 
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for  all  <;  e Rn  and  all  t e D.  Let  H0m(D)  and  Hm(D)  be,  respectively,  the 
completions  of  the  space  Y,0'  (D)  and  C‘(D)  for  the  Sobolev  norm 

\<p\m  = I | |£>V|2 

|*|<m  • 0 

and  let  us  define  the  bilinear  form 

a(u,v)  = £ | a„fi(t)D2u(t)Dpv(t)  dt.  (5.1) 

|i|  S m.  \p\  S m ' l) 

If,  with  V = H0m(D)  or  Hm(D), 

dom  L — [u  e V : v\-*a(u , v)  is  continuous  in  V with  the  L2(D)-norm}, 

then  using  the  fact  that  V is  dense  in  l}(D)  and  the  theorem  of  structure 
of  functionals  in  a Hilbert  space,  there  exists  a linear  (but  not  continuous) 
operator  L:  dom  L c l}(D)  -» 13(D)  such  that  for  u e dom  L and  v e V, 

a(u , v)  = (Lit,  v),  (5.2) 

where  ( , ) denotes  the  inner  product  in  l}(D).  Hence,  if  he  13(D).  each 
equation  in  V of  the  form 

a(u,  u)  = (h,  v ),  Vue  K (5.3) 

is  equivalent  to  the  equation 

Lu  = h.  (5.4) 

It  follows  also  from  the  L2-theory  of  linear  elliptic  boundary  value  problems 
[42]  that  dom  L is  dense  in  L2(D)  and  that  L is  a Fredholm  mapping  of 
index  zero  that  is  self-adjoint  and  has  compact  right  inverses  on  Im  L. 

Now  if  m = 1,  V — Hl(D).  and  D has  a Lipschitzian  boundary  with 
sufficiently  small  Lipschitz  constants,  it  follows  from  regularization  results 
of  Stampacchia  [70]  and  Fiorenza  [22]  that  if  he  W(D)  with  p > n,  each 
solution  u e H 1 (O)  of (5.3)  or  (5.4)  is  Holder-continuous  with  some  coefficient 
7 e ]0,  1[  and  that  if  L denotes  the  restriction  of  L to  the  space  L~  '(C°(cl  D)), 
with  C°(cl  D)  the  Banach  space  of  continuous  real  functions  on  cl  D,  then 
L:  dom  L <=  C°(cl  D ) -*  C°(cl  D)  is  also  Fredholm  of  index  zero  and  has 
compact  right  inverses  on  Im  L (for  more  details,  see  [50]  or  [51]).  We  are 
now  ready  to  study  some  semilinear  boundary  value  problems. 

Theorem  5.1.  Let  DcK"bca  bounded  open  domain  whose  boundary 
is  Lipschitzian  with  sufficiently  small  Lipschitz  constants,  a,/.  D -*  R as 
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above,  /,  / = 1,  n , and  f:c\DxR->R  continuous.  Suppose  that  the 

following  conditions  hold: 

1.  There  exists  /J  > 0,  s > 0 such  that  for  each  t e cl  D and  each  x e R , 

|/(t,  x)|  <p |x|  + s. 

2.  There  exists  R > 0 such  that  for  each  y e C°(cl  D)  such  that 

inf  \y(t)\  > R 

1 6 Cl  l) 

one  has 

I'  f(u  y(t))  dt  * 0. 

* D 

3.  f /(L  -R)dt  I ' f(t,R)dt  <0. 

■ D • D 

Then  if  /i  is  sufficiently  small,  the  variational  Neumann  semilinear 
boundary  value  problem 

£ au(t)Dix(t)Djv(t) 

i.J=l 

for  every  ve  HX[D\  has  at  least  one  solution  x that  is  Holder-continuous. 

The  proof,  which  is  given  in  [50]  or  [51],  consists  in  writing  the  problem 
in  the  abstract  form 

Lx  = Nx 

in  C°(cl  D)  with  L defined  above  and  Nx(t ) = f(t,  x(t)),  and  verifying  that 
ker  L is  the  subset  of  dom  L of  constant  functions  on  cl  D and 

lm  L=  x e C°(cl  D):  f x(0*  = o|. 

I • d I 

The  theorem  is  then  a consequence  of  Corollary  3.1. 

The  following  result  is  also  proved  in  [51]. 

Theorem  5.2.  Suppose  that / satisfies  condition  1 of  Theorem  5.1  and 
is  of  the  form 

/(f,  x)  = h(t)  - g(x), 

where  h:  cl  l)  -»  R and  y:  R -*  R are  continuous.  Then  if 

g±  = lim  g{x) 

X~*  ± <30 


dt  = I f(t,  x(t))v(t)  dt,  (5.5) 

• D 
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exist  (possibly  infinite)  and  if  either 

, g_<g(x)<g+  or  g+<g(x)<g_ 

? for  x e cl  D.  a necessary  and  sufficient  condition  for  the  existence  of  one 

solution  for  (5.5),  when  /f  is  sufficiently  small,  is  that  either 

g . < h(,  < g + or  g + < h0  < c/  _ 


with 


h0  — (meas  D)  1 | h. 

■ i) 

Let  us  now  consider  a semilinear  Dirichlet  boundary  value  problem  for 
an  arbitrary  m.  Let  /:  I)  x R -*  R be  a function  satisfying  Caratheodory 
conditions,  i.e.,  such  that 

(i)  For  each  fixed  x e R.  the  function  ft— »/(f,  x)  is  measurable  in  I) 

(ii)  For  fixed  t e I)  (a.e.),  the  function  xi —*f(t,  x)  is  continuous  in  R 

Suppose  also  that  there  exist  constants  g.  > 0,  r)  e [0,  l[  and  a function 
d e L?([))  such  that  for  f e D (a.e.), 

I fU,  x)|  < g\x\*  + d(t). 

This  implies  in  particular  that  the  mapping  N defined  by 

Nx(t)  =.f(U  x(t)) 

is  a continuous  mapping  from  L2(D)  into  itself  and  that 

|/Vx|  <^|x|A+  \d\, 

where  j | denotes  the  l3{D)- norm.  We  shall  use  Corollary  3.2  to  give  a 
different  and  shorter  proof  of  a slight  improvement  of  a theorem  due  to 
De  Figueiredo  (17). 


Theorem  5.3.  Suppose  that  the  assumptions  above  hold  for  /.  If  there 
exist  functions  fu  e L2 11  h e L2  " such  that 

lim  \x\~*f(R  x)  = h±(t),  (5.6) 


and  if  for  all  v s ker  L n bdry  B(  1 ),  one  has 

('  | f7_|u|1+<>0 

■ D.  • 1) 


(5.7) 


TOPOLOGY  AND  NONLINEAR  BOUNDARY  VALLE  PROBLEMS 


69 


with  D±  = {t  e D:v(t)  ^ 0},  then  the  semil inear  variational  Dirichlet  problem 

a(x,v)=  | f(t.x(t))v(t)dt,  (5.8) 

• n 

for  each  v e H0m(D),  with  u(.x,  v)  defined  in  (5.1),  has  at  least  one  solution. 

Proof.  It  follows  from  the  above  discussion  that  (5.8)  is  equivalent  to 
the  abstract  equation  in  L2(D) 

Lx  = Nx 

with  L and  N defined  above,  and  assumption  ( 1 ) of  Corollary  3.2  holds. 
Let  us  show  now  that  condition  (2”)  of  the  same  corollary  is  satisfied.  If 
not,  there  will  exist  a bounded  V c Im  L.  a sequence  (t„)  with  tn>  0 and 
tn  ->  x when  n — ► x,  a sequence  (zn)  with  zn  e K,  n e N*,  and  a sequence 
(w'J  with  w„  e ker  L r bdry  B{  1 ).  such  that 

I /(f,  tnwn(t)  + tnzn(t))wn(t)  (It  <0,  ne  N*.  (5.9) 

* I) 

My  going  if  necessary  to  subsequences,  we  can  assume  that  w„  ->  vv  and 
w„  + t*„  'zn~*  w in  L2(D)  and  wn(f)  — w(f),  w„(r)  + tAn~  lzn(t)  -*  w(f),  a.e.  in  D, 
when  n —*  x . Hence,  for  almost  each  tel),  (resp.  D_),  there  exists  an 
integer  n0(t)  > 0 such  that,  if  n > n0(t), 

wJ0  + tn  'zJ0>  0 (resp.  < 0), 

which  implies  that  a.e.  in  D+  (resp.  I).  ), 

+ t*  ]zn(t))->  x (resp.  -x) 

if  n -*  x.  Now.  for  each  n e N*. 

0 > | t;  *f(t,  tnw„(t)  + t„aznU))wH(t)  dt 

• l) 

= | L 7 ( )h( r)  (It  + I t~*f(  )(w„{t)  - w(t))  dt 

• D • D 

= | t„  *f{  )w(t)dt  + | tn  *f(  )wU)dt  + | tn  Y(  )(wn(f)  - w(t))dt 

• D.  • D • t) 

= //+/  n +/;• 


(5.10) 
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Using  the  Schwarz  inequality  and  the  convergence  in  13(D)  of  (w„  + tA  *zj 
to  w,  we  obtain 

|/n'|  < (|^;2y2(  )^f)  |w„-w| 

< (Hw»  + \A  + t<Ta|d|)|wn  - w|  < M|w„- w|, 

for  some  constant  M > 0,  and  hence  -*0  if  n -*■  x.  On  the  other  hand, 
the  sequence  tnwn(  ) + tnaz„(- )))),  bounded  in  Z?(D),  and  hence  in 

L?(D,),  converges  weakly  in  L2(D±)  to  its  pointwise  limit,  which  is.  using 
(5.6),  equal  a.e.  to  h±  |w|'5.  Hence, 

I t;Af(  )w(t)  dt  -»  ± | h±(t)\w{t)\1+i  dt. 

■ Dt  • D. 

Going  to  the  limit  in  (5.10)  we  finally  obtain,  using  (5.7), 

0>  | h + (t)\w(t)\l  +s  dt  - | fi_(r)|w(r)|I  +A  dt  > 0, 

•0.  • D 

a contradiction.  The  proof  is  now  complete. 

An  easy  consequence  of  Theorem  5.3  is  the  following. 

Corollary  5.1.  Suppose  that  conditions  of  Theorem  5.3  hold  with  d = 0 
and  that  a.e.  in  D and  for  each  x e R, 

M0  </(',*)<  MO-  (5.11) 

Then  condition  (5.7)  with  nonstrict  inequality  and  3 = 0 is  necessary  for 
the  existence  of  one  solution  for  (5.8),  the  same  condition  with  the  strict 
inequality  being  sufficient. 

Proof.  Sufficiency  has  been  proved.  Now,  if  x is  a solution  of  (5.8),  we 
get.  taking  v in  ker  L and  using  the  symmetry  of  the  bilinear  form  a . 

| f(t,  x(0)r(0  dt  = 0, 

* D 

i.e., 

I /(f,  x(0)|00|  dt  - I f(t,  x(0)|00|  dt  = 0, 

• l).  ■ l) 

which  together  with  (5.1 1)  implies 

| /i  + (0|u(0|  dt  — | M0|00|  dt  > 0. 

• o . • D 
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Corollary  5.1  with  m = I is  essentially  the  result  initially  proved  by 
Landesman  and  La/er  [35],  using  in  an  ingenious  but  fairly  complicated 
way  the  Schauder  fixed-point  theorem,  and  improved,  using  the  same  type 
of  argument,  by  Williams  [78]  and  Necas  [55].  Another  proof,  using  a 
perturbation  argument,  has  been  given  by  Hess  [28]  and  developed  by  De 
Figueiredo  [17],  For  various  results  of  the  same  type,  see  also  Fucik  et  al. 
[24],  Fucik  [23],  and  De  Figueiredo  [18], 

Let  us  suppose  now  that  I)  is  a bounded  domain  of  R"  with  smooth 
boundary  and  that  'J'  is  a linear  elliptic  partial  differential  operator  of 
order  m with  smooth  coefficients  acting  on  scalar  functions  satisfying 
“coercive”  (Lopatinsky  Shapiro)  smooth  boundary  conditions 


yfx  = 0 


on  bdry  /J,  expressed  in  terms  of  m/2  differential  operators  of  order  strictly 
smaller  than  m [42].  Then  it  is  known  that  the  operator  1/J  acting  on 
such  functions  is  of  Fredholm  type  and  we  shall  suppose  that  its  index 
is  zero.  We  shall  consider  the  boundary  value  problem 

'JJ  x = f [t,  x),  tel),  tfx  = 0,  t e bdry  I)  (5.12) 

with/:  D x R -»  R continuous  and  having  limits  asx->  ± oo ; for  simplicity, 
we  shall  assume  that 

lim  f(t,  x)  = MO 

X-*  ± X 

uniformly  for  t e I).  The  solutions  of  (5.1)  are  to  be  understood  as  functions 
belonging  to  Hm  P(D),  i.e..  having  generalized  derivatives  up  to  the  other  m, 

which  belong  to  IH(D),  for  every  p < rc.  Let  w,,  wd  (resp.  w,', wd  ) 

be  smooth  functions  spanning  ker  1/'  [resp.  (Im  !/')■  ].  If  a = (a, ad)  is 

any  vector  in  Rd , denote 

d 

X a,  w,  = a • w 
1=  1 

and  define  <p\  RJ  ->  Rd  by 

tpja)  = ) h + {t)w,’(t)  dt  + j h . (t)Wj'(t)  dt,  /=  1 d. 

* a ■ w > 0 * a w < 0 

We  arc  now  ready  to  formulate  some  results  for  (5.12),  which  are  due  to 
Nirenberg  [56], 
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Lemma  5.1.  Suppose  that  the  following  condition  holds: 

(UC)  The  only  solution  of 

i^w  = 0,  .'4w  = 0 on  bdry  D 

that  vanishes  on  a set  of  positive  measure  in  D is  w = 0. 

Then,  if  Sd  1 is  the  unit  sphere  in  R“,  the  mapping  cp:  Sd  1 ->  Rd  is 
continuous. 

The  proof  of  this  lemma  is  lengthy  and  can  be  found  in  [56],  It  depends 
on  the  following. 

Proposition  5.1.  Under  the  assumptions  of  Lemma  5.1.  one  has 
lim  |<J>,(u,  ra)  — (Pi(a)\  =0  (r  > 0) 

r-*  x 

uniformly  for  u bounded  in  Ll(D)  and  a e \ where 

<!>,(*/,  ra)  = | f(t , ra  • w(t ) 4-  u(f))w,'(/)  dr,  i = 1 d. 

• u 

We  now  have  Nirenberg's  theorem,  which  we  prove  in  a different  way. 
using  Theorem  3.3. 

Theorem  5.4.  Assume  that  the  assumptions  above  and  condition  (UC) 
hold.  Then  if  0 $ (p[Sd  ‘)  and  if  dh[(p.  B{  1),  0]  ^ 0,  where  <p  is  any  continuous 
extension  of^S*  1 to  cl  B(  1),  then  problem  (5.12)  has  at  least  one  solution. 

Proof.  To  apply  Theorem  3.3  we  first  formulate  problem  (5.12)  in  an 
abstract  way  by  taking  X = C°(cl  D)  with  the  uniform  norm  and,  for  some 
fixed  p > n,  dom  L = J.x  € Hm  P{D) : rtx  — 0).  From  the  theory  of  linear 
elliptic  problems  [42].  it  is  known  that  dom  L c C°(cl  D),  with  the 
canonical  injection  compact.  Let 

L:  dom  L-L^D),  x^&x,  N:  C°(cl  D)  C°(cl  D),  x^/(-,x(  )), 

dom  L = L~  ‘(C0(cl  D)),  L:  dom  L - C°(cl  D),  xi -*&z. 

It  follows  from  the  regularity  theory  of  linear  elliptic  problems  [42]  that 
(L.  N ) satisfies  the  required  Fredholm  and  compactness  assumptions  and 
that  each  solution  in  dom  L of 


Lx  = Nx 


(5.13) 
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is  a solution  of  (5.12).  Now,  condition  (1)  of  Theorem  3.3  clearly  holds 
with  S = 0.  Now,  taking  Q to  be  the  projector  such  that  Im  Q = 
spanfwj',  ...,  wd)  and  ker  Q = 1m  L,  and  using  the  fact  that  0 <p(S‘,~1) 
and  Proposition  5.1,  we  see  that  for  every  bounded  V in  Im  L,  there  exists 
r0  > 0 such  that  for  each  r > r0,  each  z e V,  and  each  a e Sd~  \ one  has 

QN(ra  • w + z)  / 0. 

which  is  essentially  condition  (2)  of  Theorem  3.3.  Now,  if 
0 < fi  < inf  \(p(a)\  = inf  |<p(a)| 

aeS"  ‘ ae  S'  ' 

and  if  r,  > 0 is  so  large  that 

sup  | <D(0,  rla)  — (p(a)\  <p, 

ae  S'  1 

we  have,  by  Rouche’s  theorem  that 

dB[JQN  | ker  L,  B(r,),  0]  = dB[ <D(0,  r,  •),  B(l),  0]  = dB[q>,  B(  1),  0]  / 0, 
where  J : Im  Q ->  ker  L,  i w-  i— * Yj=  i k,  wt , and  the  proof  is  complete. 
For  extensions  to  elliptic  systems  of  differential  equations,  see  [56]. 


6.  Coincidence  Degree  and  Alternative  Problems 

Very  recently,  Hetzer  [29]  has  shown  that  when  X and  Z are  Banach 
spaces,  the  coincidence  degree  theory  sketched  in  Section  1 could  be 
developed  in  a similar  way  as  in  [47]  by  using,  instead  of  the  Leray- 
Schauder  theory,  the  more  general  degree  for  a-contractions  due  to 
Nussbaum  [57],  Borisovich  and  Sapronov  [3],  Sadovskii  [65],  and  others. 
We  shall  exhibit  in  this  section  the  main  aspects  of  Hetzer’s  article  and 
then  develop  its  relations  with  alternative  problems. 

Let  us  first  recall  that  if  Y is  a metric  space  and  B a subset  of  Y,  the 
(Kuratowski)  measure  a (B)  of  noncompactness  of  B [33]  is  defined  by 

a (B)  = inf  {d  > 0 : B has  a finite  cover  by  sets 

having  a diameter  smaller  than  d). 

If  K,  and  Y2  are  metric  spaces,  a continuous  mapping  /:  T,  -»  Y2  will  be 
said  to  be  an  ot-contraction  (or  a k-set  contraction)  if  there  is  a k s [0,  1[ 
such  that,  for  each  bounded  set  B c Fj,  one  has 

«(/(«))  < MB). 
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When  Y2  is  a Banach  space  and  Yj  a subset  of  a Banach  space,  an  example 
of  a-contraction  /:  Y,  -»  Y2  is  given  by  / = j\  + f2,  with  /,  Lipschitzian  of 
constant  k e [0,  1[  and  f2  compact  on  Yj. 

Now  let  X and  Z be  real  Banach  spaces  and  L:  dom  LcX-*Za  linear 
Fredholm  mapping  of  index  zero  for  which  notations  of  Section  2 will  be 
conserved.  If  ft  <=  X is  open  and  bounded,  we  shall  say  that  the  mapping 
N : cl  ft  -» Z is  an  L-a-contraction  if : 

(a)  UN:  cl  ft  ->  coker  L is  continuous  and  n A7(cl  ft)  is  bounded. 

(b)  KP  Q N:  cl  ft  -+  X is  an  oi-contraction. 

By  noting  that,  if  P,  Q are  other  continuous  projectors  such  that 
Im  P = ker  L,  Im  L = ker  Q, 
we  have  (cf.  [47]  or  [51]) 

Kp^N  = (I-  P)KP  qN  + (I-  P)KP(Q  - Q)N, 

It  is  not  hard,  using  some  of  the  basic  properties  of  the  measure  of  non- 
compactness, to  check  that  Kp  QN  is  an  a-contraction  with  the  same 
constant.  Thus,  the  concept  is  independent  of  the  choice  of  projectors  P 
and  Q. 

Moreover,  if  we  now  assume  that 

0 $ (L  - N)( dom  L n bdry  ft),  (6. 1 ) 

then  we  can  still  define  the  coincidence  degree  of  L and  N in  ft  by 

d[(L,  N),  ft]  = d,[l  - P - (An  + Kp , q)N,  ft,  0],  (6.2) 

where  dt  denotes  the  degree  for  perturbations  of  identity  by  a-contractions. 
Hetzer  has  shown  in  [29]  that  the  argument  of  [47]  for  proving  the 
invariance  of  the  right-hand  member  of  (6.2)  with  respect  to  P , Q and 
orientation-preserving  isomorphisms  A:  coker  L -*  ker  L still  holds  in  this 
new  frame.  Moreover,  Hetzer  [29]  has  also  shown  that  the  basic  properties 
of  coincidence  degree  are  preserved  in  this  extension,  as  well  as  the  generalized 
continuation  theorem  of  [47]  recalled  in  Section  2 and  the  generalized 
Granas  theorem  [48]  recalled  in  Section  3.  An  interesting  application  of 
those  existence  theorems  would  be  the  existence  of  T-periodic  solutions 
(or  of  other  boundary  value  problems)  for  neutral  functional  differential 
equations  of  the  form 


x'(t)=f(t,  xf,x/) 
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with/T-periodic  in  r,  continuous,  taking  bounded  sets  into  bounded  sets, 
and  Lipschitzian  of  constant  k e [0,  1[  with  respect  to  the  third  variable 
(see  Sadovskii  [64]). 

It  is  clear  that  very  useful  sufficient  conditions  for  the  L-compactness  of 
a mapping  N are  either  that  KP  is  continuous  and  N compact  or  KP  compact 
and  N continuous  and  taking  bounded  sets  into  bounded  sets.  An  interesting 
sufficient  condition  for  the  L-a-contractive  character  of  N has  been 
introduced  by  Hetzer  in  [29],  when  L is  a closed  Fredholm  mapping.  He 
introduces  the  (finite)  number. 

/(L)  = sup  {r  e R+  : for  each  bounded  B <=  dom  L,  r<x{B)  < a (L(B)). 

He  then  proves  the  following: 

Proposition  6.1.  A sufficient  condition  for  the  a -contraction  N: 
cl  ft  -»  Z,  with  constant  k,  to  be  an  L-a-contraction  on  cl  ft  is  that  k e [0,  /(L)[. 

We  shall  also  note  that  the  number  /(L),  which  is  defined  for  closed  <I>  + 
and  <D_  operators,  has  been  used  by  Hetzer  in  [29]  to  strengthen  and  unify 
some  known  results  on  the  perturbations  of  semi-Fredholm  operators. 

Let  us  now  develop  the  relations  of  this  coincidence  degree  with  the 
classical  alternative  problems  [26,  Chapter  IX]  when  condition  (b)  above 
in  the  definition  of  L-a-contraction  is  replaced  by  the  stronger  condition 

(b  ) There  exists  k e [0,  1[  such  that  for  each  x,  y e cl  Q, 

I Kp,q(Nx  - Ny) | < k\x  - y\. 

Then,  as  shown  by  Browder  in  [7],  the  mapping  H : c\Q~*  X defined  by 

H = I - KP  qN 

belongs  to  a convex  class  of  permissible  homeomorphisms  [6],  i.e.,  of  homeo- 
morphisms  on  cl  fl  such  that  H is  a homeomorphism  of  Q onto  an  open 
set  //(ft)  of  X that  maps  cl  ft  homeomorphically  onto  cl  (//(ft)).  Also,  as 
easily  checked,  H preserves  the  fibers  of  P,  i.e., 

PHx  = Px,  x € cl  ft, 

which  implies  at  once  that  H~ 1 : cl  //(ft)  -*  X also  preserves  the  fibers  of  P, 

i.e., 


PH  l(x)  = Px,  x eel  ft. 


(6.3) 
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One  has  also,  if  Z = ft  or  cl  ft, 

PH(  I)  = P(Z),  H(  I)  nlmPc  P(Z).  (6.4) 

We  can  now  prove  the  following  basic 


Theorem  6.1.  If  conditions  above  hold,  then 

d[(L,  N),  ft]  = </„[-  AI\NH~  1 1 ker  L,  //(ft)  n ker  L,  0],  (6.5) 

Proof.  If 

F = 1 - P - (An  + KPQ)N,  c=-aun-p, 

V(x,  y)  = Kp  Q Nx  + An Ny  + Py,  S(x,  y)  = x - V(x , y), 

then  V satisfies  the  conditions  of  Section  1 in  Nussbaum  [57],  and  hence 

d[(L,  N),  «]  = da[I  -!/(•,  ),  Q,  0]  = degAf[P,  Q,  0],  (6.6) 

where  M is  the  convex  class  of  strict-contractive  perturbations  of  identity 
and  degM  [P,  Q,  0]  is  the  Browder-Nussbaum  degree  [6,  8]  defined  here  by 

degM  [F,  fi,  0]  = degM  [[P,  Sj,  Q,  0]  = dLS[I  + H~  ‘C,  -C~  *(//(Q)),  0], 

On  the  other  hand,  we  are  also  in  a position  to  apply  Theorem  3 of 
Browder  [6]  and  hence 

degM  [P,  Q,  0]  = deg,  [[P,  HI  fi,  0],  (6.7) 

where  the  right-hand  member  is  the  Browder  degree  defined  in  [6]  by 

deg,  [[P,  HI  Q,  0]  = dLS[I  + CH~\  H(Cl ),  0],  (6.8) 

Now,  using  (6.3)  and  the  definition  of  Leray-Schauder  degree, 

dLS[l  + C7T  ',  //(«),  0]  = dLS[l  ~ P - A I1NH- ',  //(fi),  0] 

= dH[  - \Y\NH ~ ' | ker  L,  H(Cl)  n ker  L,  0],  (6.9) 

and  the  result  follows  from  Eqs.  (6.6)-(6.9). 

Theorem  6. 1 can  be  improved  if  one  makes  a supplementary  assumption, 
which  is  classical  in  alternative  methods. 

Theorem  6.2.  If  conditions  of  Theorem  6.1  hold  and  if  for  each 
ae  P(cl  fl),  the  mapping  Ta  = x\~*a  + KP  QNx  is  a strict  contraction  of 
cl  ft  into  itself  such  that,  if  a e P(ft),  7^(ft)  cz  ft,  then 

d[(L  N),  ft]  = dB[- AflA/P  | ker  L,  P(ft),  0] 
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where  R(a)  is  the  (unique)  solution  of  the  equation 

y = a + KP  QNy. 

Proof.  By  the  assumption  and  the  Banach  fixed-point  theorem,  R(a) 
exists  and  is  unique  for  each  a e P(c\  Q),  R:  P( cl  Q)->A'  is  continuous, 
and  R(a)  e Q if  a 6 P(Q).  Also,  necessarily,  R(a)  = H~l(a)  for  a e P( cl  Q), 
which  implies  that  H(Q)  r>  P(Cl)  and  therefore,  using  (6.4),  that  P(Q)  = 
H(Q)  n ker  L.  TJie  result  then  follows  from  Theorem  6.1. 

If  we  note  that  AIIAf  can  be  written  JQN  with  J : Im  Q ->  ker  L some 
isomorphism,  we  note  that  — AYINR  is  nothing  but  the  mapping  defining 
the  bifurcation  equations  of  the  corresponding  alternative  method.  Hence, 
Theorem  6.2  explicitly  relates  the  coincidence  degree  of  L and  N in  fi  with 
the  Brouwer  degree  of  the  bifurcation  mapping  at  zero  in  P(Q).  It  contains, 
as  special  cases,  results  of  O’Neil  and  Thomas  [58]  and  Thomas  [73,  74] 
relating  Browder-Nussbaum-Petryshyn  degrees  to  Cronin’s  multiplicity 
[12,  14],  which  is  nothing  but  the  Brouwer  degree  of  the  bifurcation 
mapping  of  an  alternative  problem  with  L = / - A,  A compact.  See  also 
Williams  [77]  for  a corresponding  result  in  the  line  of  Cesari’s  approach. 


7.  Nonlinear  Perturbations 

of  Fredholm  Mappings  of  Nonzero  Index 

All  the  results  exposed  up  to  now  concern  nonlinear  perturbations  of 
linear  Fredholm  mappings  with  zero  index.  When  Ind  L > 0,  one  can  still 
find  linear  one-to-one  mappings  A:  coker  L -*  ker  L,  but  the  following  fact, 
noted  in  [47]  and  [56],  makes  necessary  the  use  of  topological  tools  more 
sophisticated  than  the  degree. 

Proposition  7.1.  If  L : dom  L cr  X ->  Z,  with  X,  Z normed  spaces,  is  a 
Fredholm  mapping  with  Ind  L > 0,  if  N\  cl  fi  c X -*  Z is  L-compact  with 
n c X open  and  bounded,  and  if  0 i (L  - Af)(dom  L n bdry  Q),  then  for 
each  A as  above, 

- p - (An  + kp , q)n , n,  o]  = o. 

This  “negative”  property  has,  however,  been  used  by  Rabinowitz  [60] 
to  prove  an  existence  theorem  for  some  semilinear  elliptic  boundary  value 
problems  that  we  shall  just  translate  here  in  abstract  form,  referring  to  the 
original  paper  for  applications  and  further  comments. 


78 


JEAN  MAWHIN 


Theorem  7.1.  If  L:  dom  L c.  X -*■  Z is  a Fredholm  linear  mapping  with 
Ind  L > 0,  and  N:  X -»Z  is  L-compact  on  closed  bounded  sets  of  X and 
odd,  then  for  each  open,  bounded,  symmetric  neighborhood  ft  of  0,  there 
exists  at  least  one  x e dom  L n bdry  ft  such  that  Lx  = Nx. 

Proof.  If  the  theorem  is  not  true,  there  exists  an  open,  bounded, 
symmetric  neighborhood  ft  of  0 such  that  0 $ (L  - N)(d om  L n bdry  ft) 
and  hence,  using  Borsuk’s  theorem  for  degree  of  odd  mappings, 
dLS[I  — P — (An  + KP  q)N,  ft,  0]  is  odd,  a contradiction  with  Proposition 
7.1. 

Rabinowitz  [60]  has  also  proved  that,  under  conditions  of  Theorem  7.1, 
the  set  of  zeroes  of  L — N has  a symmetric  unbounded  component  con- 
taining 0 and,  using  Krasnosefskii’s  concept  of  genus  [30],  has  more 
information  on  this  set. 

We  shall  now  state  without  proof  an  interesting  topological  result  due 
to  Nirenberg  [56].  Let  A'  be  a Banach  space,  P:  X -»  X a continuous 
projector  with  range  of  dimension  m > 0,  d>:  Im  P n cl  B(r)  ->  Y c Im  P a 
continuous  mapping  such  that  0 d>(Im  P n bdry  B(r)),  with  Y a subspace 
of  Im  P of  dimension  p < m.  Let  Z:  Sm~ 1 -*  Im  P n bdry  B(l),  I': 
Y n bdry  B(l)  -» Sp~ 1 be  isomorphisms,  and 

TiS™-1  -*Sp-\  u (— » Z'<D(rZw)/ 1 <D(rZw)  | 

with  Sk~  1 the  unit  sphere  in  Rk. 


Proposition  7.2.  If  has  nontrivial  stable  homotopy,  then  every  map- 
ping F:  cl  B(r)  -*  X such  that  F = F(-,  1)  for  some  compact  mapping 
F:  cl  B(r)  x [0,  1]  -*•  X verifying 

x # F(x,  A),  x e bdry  B(r),  k 6 [0,  1],  F(\ 0)  = P + <I>P 

has  at  least  one  fixed  point  in  B(r). 

This  result  is  used  in  [56]  to  extend  Theorem  5.4  to  elliptic  operators 
W with  Ind  > 0.  In  this  assertion,  the  nonvanishing  of  the  Brouwer 
degree  of  <p  is  replaced  by  the  assumption  of  nontriviality  for  the  stable 
homotopy  of  the  mapping  ¥:  Sd~1  ->  Sd  ~l  defined  by  4*  = </>/ 1</>|,  with 
d'  = dim  coker  iF.  Some  further  results  are  given  by  Cronin  in  [15].  More 
generally,  Proposition  7.2  can  be  used  to  give  the  following  extension  of 
Theorem  2.1,  with  a quite  similar  proof. 
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Theorem  7.2.  Let  X,  Z be  Banach  spaces;  let  L:  dom  L c X -*  Z be  a 
linear  Fredholm  mapping  with  Ind  L > 0;  let  N:  cl  B(r)  c X ->  Z be  L- 
compact;  and  suppose  that  conditions  (1)  and  (2)  of  Theorem  2.1  hold  with 
ft  = B(r).  Then,  if  the  mapping 

'Y:Sm-1  -Sp-‘,  uh^S'AnAr(rIu)/|AnN(rIu)| 

with  m = dim  ker  L,  p = dim  coker  L,  E:  Sm_  1 -*•  ker  L n bdry  B(l),  E': 
Im  P n bdry  fl(l)-»Sp  1 some  isomorphisms,  has  nontrivial  stable  homo- 
topy,  the  conclusions  of  Theorem  2.1  hold. 

Nirenberg’s  results  for  elliptic  equations  have  been  somewhat  extended 
by  Tromba  [75]  when  / is  smooth,  by  the  use  of  the  Elworthy-Tromba 
degree  [19]  for  nonlinear  proper  Fredholm  mappings  on  Banach  manifolds 
based  on  the  Pontryagin-Thom  theory  of  framed  cobordism. 

When  Ind  L < 0,  no  linear  one-to-one  A;  coker  L -*  ker  L exists  [47], 
which  makes  the  problem  very  difficult. 
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Bifurcation  is  a term  used  in  several  parts  of  mathematics.  It  generally 
refers  to  a qualitative  change  in  the  objects  being  studied  due  to  a change 
in  the  parameters  on  which  they  depend.  We  are  interested  in  the  set 
of  solutions  of  functional  equations,  and  for  the  examples  we  have 
in  mind  the  following  more  precise  definition  suffices.  Let  X and  Y be 
Banach  spaces,  U a X,  and  F:  U -»  Y.  Suppose  there  is  a one-to-one  curve 
./  = {x(f)|f  e (0, 1 )}  c:  U such  that  F(z)  = 0 for  z e ./ . A point  we./  is  called 
a bifurcation  point  for  F with  respect  to  ./  (or  more  simply  a bifurcation 
point ) if  every  neighborhood  of  w contains  zeros  of  F not  on  ./ . In  applica- 
tions,  after  possibly  makinga  change  of  variables,  one  usually  has  X = Rx£ 
with  E a real  Banach  space,  F = F(k,  u),  and  ./  = {(/,  0)|A  e {a,  b)  c IR}.  We 
restrict  our  attention  to  this  case.  The  members  of ./  will  be  called  trivial 
solutions  of  F(k , u)  = 0.  Thus  we  are  interested  in  nontrivial  zeros  of  F. 

Much  of  the  motivation  for  studying  bifurcation  is  provided  by  varied 
phenomena  in  the  physical  sciences,  which  can  be  formulated  in  these  terms. 
We  illustrate  with  some  ol  the  standard  model  cases. 

I.  Thermal  Convection — The  Benard  Problem 

An  infinite  horizontal  layer  of  a viscous  incompressible  fluid  lies  between 
a pair  of  perfectly  conducting  plates.  A temperature  gradient  T is  maintained 
between  the  plates,  the  lower  plate  being  warmer.  If  T is  appropriately 
small,  the  fluid  remains  at  rest,  the  temperature  is  a linear  function  of  the 
vertical  coordinate,  and  heat  is  transported  through  the  fluid  solely  by 
conduction.  However,  if  T exceeds  a certain  value,  the  fluid  undergoes  time- 
independent  motions  called  convection  currents  and  heat  is  transported 
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through  the  fluid  by  convection  as  well  as  conduction.  In  actual  experiments 
the  fluid  breaks  up  into  cells  whose  shape  depends  in  part  on  the  shape 
of  the  container  Mathematically  the  equilibrium  configuration  of  the  fluid 
is  described  by  a system  of  nonlinear  partial  differential  equations.  For  each 
value  of  T.  there  is  a conduction  solution  satisfying  these  equations. 
Formulated  in  the  general  Banach  space  framework,  these  solutions 
correspond  to  the  trivial  solutions,  while  the  value  of  7 at  which  convection 
begins  corresponds  to  a bifurcation  point  For  more  on  the  physical  and 
mathematical  aspects  of  this  problem,  see  [1.  2] 

II.  Rotating  Fluids — The  Taylor  Problem 

A viscous  incompressible  fluid  lies  between  a pair  of  concentric  cylinders 
(whose  axis  of  rotation  is  vertical).  The  inner  cylinder  rotates  at  a constant 
angular  velocity  <a  while  the  outer  cylinder  remains  at  rest.  If  oj  is  sufficiently 
small,  the  fluid  particles  move  in  circular  orbits  with  velocity  depending 
on  their  distance  from  the  axis  of  rotation.  Equilibrium  states  of  the  fluid 
are  solutions  of  the  time-independent  Navier  Stokes  equations.  The  above 
solution  is  called  Couette  flow  and  exists  as  a mathematical  solution  of  the 
governing  equations  for  all  values  of  <o  When  oj  exceeds  a certain  critical 
value,  the  fluid  breaks  up  into  horizontal  bands  called  Taylor  vortices  and 
a new  fluid  motion  periodic  in  the  vertical  direction  is  superimposed  on 
the  Couette  flow.  For  this  example.  Couette  flow  corresponds  to  the  trivial 
solutions  in  the  general  framework  and  the  value  of  m producing  the 
onset  of  Taylor  vortices  corresponds  to  a bifurcation  point  [1.  2], 

ill.  Buckling  Phenomena  in  Elasticity — The  Flat  Plate 

A thin,  planar,  clamped  elastic  plate  is  subjected  to  a compressive  force 
along  its  edges  If  the  magnitude  /.  of  this  force  is  small  enough,  the  plate 
remains  motionless  and  in  equilibrium  However,  if  /.  exceeds  a certain 
value,  the  plate  deflects  out  of  its  plane  and  assumes  a new  nonplanar 
equilibrium  position  called  a buckled  state.  Equilibrium  configurations  of 
the  plate  satisfy  a system  of  nonlinear  partial,  differential  equations  called 
the  von  Karman  equations.  The  unbuckled  state  is  a solution  of  these 
equations  for  all  values  of  / and  corresponds  to  the  trivial  solutions,  while 
the  value  of  /.  at  which  buckling  occurs  corresponds  to  a bifurcation 
point  [3  5] 

These  examples  have  all  dealt  with  bifurcation  of  time-independent 
solutions  of  a functional  equation  to  new  time-independent  solutions  Of 
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course,  more  complicated  kinds  of  bifurcation  may  occur,  such  as  bifurcation 
of  time-independent  to  time-periodic  solutions.  This  occurs  in  particular  for 
the  Taylor  problem.  There  one  finds  experimentally  that  if  o is  in  the 
Taylor  vortex  flow  regime  and  is  increased  still  further,  another  bifurcation 
point  is  reached  where  the  Taylor  vortices  are  replaced  by  a new  How 
sometimes  called  wavy  vortices  [6],  which  is  periodic  both  in  the  vertical 
direction  and  in  time. 

There  is  a considerable  literature  on  bifurcation,  both  for  particular 
physical  problems  and  for  the  general  theory.  Several  examples  of  the  former 
are  contained  in  [1]  and  [4],  As  references  for  the  latter,  we  suggest  the 
survey  papers  of  Vainberg  and  Trenogin  [7],  Prodi  [8],  Sather  [9]. 
Kirchgassner  and  Kielhofer  [2],  and  Stakgold  [10]:  the  books  of  Krasnoselski 
[11]  and  Antman  and  Keller  [4]:  and  the  lecture  notes  of  Pimbley  [12], 
Sattinger  [13],  looss  [14],  and  Nirenberg  [15].  For  a more  general  view 
of  bifurcation,  we  refer  the  reader  to  the  survey  paper  of  Arnold  [16].  Several 
other  references  will  be  given  below  but  no  attempt  will  be  made  to  give 
a complete  bibliography. 

Returning  to  the  general  theory,  there  are  three  main  questions  of  interest: 

(i)  What  are  necessary  and  sufficient  conditions  for  (/c  0)g  ./  to  be  a 
bifurcation  point? 

(li)  What  is  the  structure  of  the  set  of  zeros  of  F(/.,  u)  near  (/c  0)? 

(in)  In  physical  problems  such  as  Till  above  where  there  is  an 
underly  ing  evolution  equation  of  which  the  solutions  described  are  equilib- 
rium solutions,  determine  which  solutions  are  stable  or  unstable. 

This  survey  will  mainly  address  (i)  and  (ii)  since  (iii)  involves  rather 
different  techniques.  Moreover,  we  will  focus  on  bifurcation  of  time- 
independent  solutions  of  F(/.,  u)  — 0 to  new  time-independent  solutions, 
although  at  the  end  of  our  survey  some  other  cases  will  be  discussed.  Both 
analytical  and  topological  methods  have  been  employed  to  study  (i)  and 
(ii)  and  we  begin  with  the  former. 

For  what  follows,  Ck(U.  V)  denotes  k times  continuously  Predict 
differentiable  maps  from  U to  V.  The  null  space  of  a linear  map  is  denoted 
by  \'(L)  and  its  range  by  R(L).  The  dimension  of  a subspace  Z of  E or  Y 
will  be  denoted  by  dim  Z. 

Suppose  if  is  a neighborhood  of  0 in  E and  F e C((u,  b)  x Q.  V ) with 
F(/..  0)  = 0 for  /.  g (u.  h).  If  further  F is  Frechet  differentiable  with  respect 
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to  u at  (/,  0)  and  £„(/,  0)€  C((a , h),  T),  then  a necessary  condition  that 
(/L  0)  be  a bifurcation  point  is  that  Fu(n , 0)  is  not  an  isomorphism  from  £ 
to  Y.  Indeed,  the  hypotheses  on  F imply 

(1)  £(/.,  u)  = £„(/.,  0)u  + //(/,  u), 

where  ||  //(/,  n)||  = o(||u||)  at  u = 0.  Therefore  if  £(/.,  u)  = 0 for  (/.,  u)  near 
(n,  0)  and  Fu(n,  0)  is  an  isomorphism, 

(2)  -u  = Fu(l0)-'H(A,u). 

Since  the  right-hand  side  of  (2)  is  o(||m||)  at  u = 0,  (2)  can  be  satisfied  near 
u = 0 only  by  the  trivial  solutions.  If  £e  C‘((u,  b)  x Q,  Y ) the  result  is 
immediate  from  the  implicit  function  theorem. 

This  necessary  condition  is  not  sufficient  as  simple  examples  show.  For 
example,  for  £ = Y = R2, 

(3,  F(A,U)=<>-A)(";)  + (_£). 

and  n = 1,  multiplying  the  first  equation  in  (3)  by  u2,  the  second  by  ut,  and 
subtracting  shows  F has  no  nontrivial  zeros.  Thus,  to  get  bifurcation  more 
must  be  assumed  concerning  FH(p,  0)  or  the  nonlinear  part  of  F. 

If  £ is  C1  near  (/l  0)  and  £u(/l  0)  is  a Fredholm  operator  from  £ to  Y, 
the  bifurcation  problem  can  be  reduced  to  a finite-dimensional  one  via  the 
so-called  method  of  Liapunov-Schmidt.  Namely,  let  Z be  a complementary 
space  to  jV(£u(p,0))  in  £and  write  u = v + w,whereue  N{Fu(n,  0))andwe  Z. 
If  (/,  u)  is  near  (/i,  0)  and  £(/,  u)  = 0,  by  Taylor’s  theorem  we  can  write 

(4)  0 = £(/,  u)  = Fin,  0)  + £,(/i,  0)(A  - n) 

+ F„(n,  0)w  + o(||p||  + || ||  + \k  - n\) 

= Fu(n,  0)w  + H(v,  w,  a), 

where  a = / — n.  Let  Y be  a complementary  subspace  to  R(Fu(n , 0))  in  Y 
and  let  P (resp.  P)  denote  the  projector  of  Y onto  R(Fu(n,  0))  (resp.  Y) 
along  ? [resp.  R(Fu(n,  0))].  Since  £u(p,  0)  is  an  isomorphism  from  Z to 
R(Fu(n,  0)),  by  the  implicit  function  theorem  there  is  a function  w = <p( a,  v) 
satisfying 

(5)  £„(//,  0)w  + PH(v,  w,  a)  = 0 

for  (at,  v)  near  (0,  0)  with  </>( 0,  0)  = 0 and  (p  continuously  differentiable 
near  (0,  0).  Thus,  the  bifurcation  problem  reduces  to  finding  nontrivial 
solutions  of 


(6) 


PH(v,  <p(or,  i>),  at)  = 0, 
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which  represents  dim  Y equations  in  1 + dim  N(Fu(g,  0))  unknowns.  This 
generally  is  still  a formidable  problem  and  cannot  be  solved  without  more 
hypotheses.  See,  e.g.,  [7,  9],  or  [17-19]  for  further  work  on  this  subject. 

A very  useful  special  case  and  one  that  occurs  in  many  applications 
(e.g.,  examples  I and  II  above  and  sometimes  III)  arises  when  0 is  a 
"simple  eigenvalue”  of  Fu(g,  0).  Then  a fairly  complete  description  of 
the  solution  set  of  F = 0 near  (g,  0)  can  be  given.  There  are  various 
versions  of  such  results  (see,  e.g.,  [4,  8,  18-20]).  First  the  notion  of  “simple 
eigenvalue”  will  be  made  more  precise.  Let  B(E , Y)  denote  the  set  of  bounded 
linear  maps  from  E to  Y.  If  L,Ke  B(E , Y)  and  r e R,  we  say  r is  a K-simple 
eigenvalue  of  L if  dim  N(L  — rK)  = 1 = codim  R(L  — rK)  and 

Kvi  R(L  - rK), 

where  v spans  N(L  — rK).  If  E = Y,  L is  compact,  and  K = /,  the  identity 
map,  it  is  easy  to  see  that  g is  an  /-simple  eigenvalue  of  L if  and  only  if 
it  is  a simple  eigenvalue  of  L in  the  usual  sense  (i.e.,  the  algebraic  and 
geometric  multiplicities  of  g are  1). 

Theorem  7.  Suppose  F e C2((a,  h)  x Q,  Y),  £(A,  0)  = 0 for  A e (a,  h), 
g € (a,  b),  and  0 is  an  F ^u(g,  0)-simple  eigenvalue  of  Fu(g,  0).  If  Z is  any 
complement  of  N(Fu(g,  0))  = span{i;}  in  E,  then  there  exists  a neighborhood 
of  U of  (g,  0)  in  R x E,  a 5 > 0,  and  functions  (peC'((-S,  6),  R), 
if  e C‘((  — 6,  <5),  Z)  such  that  <p(0)  = 0,  t/^(0)  = 0,  and 

F~  '(0)  n U = {(A,  0)| (A,  0)e  U)  u {((p(s),  sv  + .vi//(s))|  |s|  < 5}. 

This  theorem  can  be  proved  with  the  aid  of  the  Lyapunov-Schmidt 
procedure  but  it  is  simpler  to  work  directly  and  get  the  existence  assertions 
from  the  implicit  function  theorem.  The  uniqueness  is  a consequence  of 
the  uniqueness  inherent  in  the  implicit  function  theorem  together  with  an 
additional  estimate.  This  version  of  the  “simple  eigenvalue  theorem”  is  due 
to  Crandall  and  Rabinowitz  [21,  22].  For  another  interesting  proof,  see  [15]. 
If  F is  real  analytic,  then  by  the  analytic  version  of  the  implicit  function 
theorem,  (A,  u)  are  real  analytic  in  ,s  near  s = 0 and  may  be  obtained  by 
a formal  expansion  procedure. 

The  smoothness  assumption  on  F may  be  weakened  slightly  [21]. 
Minimal  smoothness  hypotheses  prove  useful  for  technical  reasons  when 
dealing  with  differential  operators  that  may  only  be  defined  on  a subspace 
of  a Banach  space  [21]. 

For  E = Y = R",  a bifurcation  theorem  has  been  obtained  recently  by 
Kopell  and  Howard  [23]  that  proves  to  be  equivalent  to  Theorem  7 for 
£ = R".  (See  also  [24].)  Their  result  is: 
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Theorem  8.  If  Fe  C2((a , b ) x Q,  R")  with  F(A,  0)  = 0 for  Ae  ( a , b ), 
det  Fu(n.  0)  = 0,  and  (d/dk)( det  F„(A,  0))|>=M  ^ 0,  then  the  conclusion  of 
Theorem  7 is  obtained. 

When  its  hypotheses  are  satisfied,  Theorem  7 gives  a fairly  satisfactory 
answer  to  questions  (i)  and  (ii)  raised  above.  It  turns  out  that  within  its 
context,  a partial  answer  can  also  be  given  to  question  (iii).  As  with  ordinary 
differential  equations,  the  stability  of  an  equilibrium  solution  (A,  u)  of 
u,  = F(A,  u ) is  closely  connected  with  the  “spectrum”  of  FU(A,  u).  In  fact,  in 
the  applied  mathematics  literature  if  the  “spectrum”  of  FU(A,  u ) lies  in  an 
appropriate  half-plane  (A,  u)  is  often  called  stable  and  otherwise  unstable. 
Likewise,  the  terms  linearized  stability  and  linearized  instability  are 
commonly  used  to  describe  the  spectrum.  The  following  result  describes 
how  the  zero  eigenvalue  of  Fu(p , 0)  changes  along  the  zero  set  of  F near  (p,  0). 


Theorem  9.  Suppose  £ c Y with  continuous  injection  / and  0 is  an 
/-simple  eigenvalue  of  Fu(p,  0).  If  in  addition  the  hypotheses  of  Theorem  7 
are  satisfied,  then  there  exist  open  intervals  2,  fi  <=  R with  p e a,  0 e //,  and 
functions  ye  C'(a,  R),  pe  R),  x e C'(  a,  £),  we  C‘(/?,  £)  such  that 

(i)  y(A)  is  an  /-simple  eigenvalue  of  FU(A,  0)  and  FU(A,  0)x(A)  = y(A)x(A), 
A e a. 

(ii ) p(s)  is  an  /-simple  eigenvalue  of  Fu(A(.v),  u(s))  and  Fu(A(.v),  u(s))w(s)  = 
p(.s)H-(.s),  s e fi. 

(iii)  y (p)  = 0 = p( 0),  x(p)  = v = w(0),  x(A)  - c e Z,  w(s)  -veZ. 

(iv)  -/(£)  # 0. 

(v)  Near  s = 0,  p(s)  and  — sA'(s)y'(p)  have  the  same  zeros  and  whenever 
A'(s)  # 0,  the  same  sign. 

The  proof  of  Theorem  9 is  an  elementary  consequence  of  the  implicit 
function  theorem,  Theorem  7,  and  Taylor’s  theorem.  Parts  (i)  and  (ii)  tell 
us  how  the  zero  eigenvalue  of  Fu(p,  0)  is  perturbed  along  the  zero  set  of  F 
near  (p,  0).  This  leads  to  (v),  which  permits  us  to  use  qualitative  knowledge 
of  the  bifurcation  picture  to  deduce  information  about  the  spectrum  of  Fu. 
For  example,  for  the  Benard  problem  with  p the  smallest  real  number  at 
which  FU(A,  0)  develops  a null  space,  it  is  fairly  easy  to  show  y'(p)  < 0 and 
A(.s)  > p for  .s  ^ 0 [21,  22].  Hence  p(s ) > 0 for  s ^ 0.  Thus  in  a linearized 
sense  the  convection  solutions  (A(s),  u(.v))  are  stable  since  p(s)  moves  into  the 
half  plane  in  C with  positive  real  part  [and  the  rest  of  the  spectrum  of 
Fu(A(.s),  u(.sj)  remains  there]  while  the  conduction  solutions  (A,  0)  are  stable 
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for  A < //  [/'(/<)  < 0]  and  unstable  for  y > /i.  For  the  Taylor  problem  it  is 
known  experimentally  that  the  same  qualitative  picture  obtains  but  it  has 
not  yet  been  shown  mathematically.  The  difficulty  is  in  establishing  the 
/-simplicity  of  0. 

The  first  result  in  the  direction  of  Theorem  9 we  know  of  is  due  to 
Hopf  [24],  who  essentially  treated  the  case  E = Y = U",  F real  analytic, 
and  A(.s)  = cs  + o(  |.s| ) or  A(s)  = cs2  + o(s2)  with  c # 0.  More  recently, 
Sattinger  [25]  studied  the  case 

(10)  F(A,  u)  = (L  — Afl)u  + //(A,  u)  = 0, 

where  L and  B are  linear  and  L has  a compact  inverse  that  can  be  used 
to  convert  (10)  to  an  equivalent  problem 

(11)  (/  - A7>  + N(X,  u)  = 0 

with  T and  N compact.  Under  additional  technical  hypotheses  [which 
implied  ;'(/i)  < 0 and  A'(s)  ^ 0 for  s ^ 0]  Sattinger  used  a topological  degree 
argument  to  show  p(s)  and  sA'(s)  have  the  same  sign  for  s / 0.  Theorem  9 
is  due  to  Crandall  and  Rabinowitz  [22], 

The  next  step  in  treating  the  stability  question  is  to  show  how  information 
about  the  spectrum  of  £„(/,  u)  with  (a,  ii)  an  equilibrium  solution  of 

(11a)  u,  = F(A,  u) 

can  be  used  to  determine  the  behavior  of  solutions  of  (11a)  for  initial  data 
near  u (for  A fixed  at  A).  We  will  not  discuss  this  point  here  but  refer 
the  reader  to  the  literature  [26,  2,  13,  14],  where  this  question  is  treated, 
often  in  the  context  of  the  Navier-  Stokes  equations. 

We  return  again  to  questions  (i)  and  (ii).  In  addition  to  analytical 
methods,  topological  degree  theoretic  arguments  and  variational  methods 
have  been  successfully  employed  to  study  bifurcation  problems  and  some 
of  the  main  results  obtained  in  this  fashion  will  be  presented  next.  The 
first  such  result  we  know  of  is  due  to  Krasnoselski  [11].  To  state  it,  let 
Y = E.  Le  B(E,  £),  and  /i  a characteristic  value  of  L,  i.e.,  there  exists 
ref,  i^0  such  that  r = fiLv.  We  say  n is  of  multiplicity  k if 
dim  span  % N((l  — pLy)  = k.  Then  Krasnoselski’s  result  is 

Theorem  12.  Let  F(/,  u)  = u — XLu  — H(a , u)  and  ft  be  a neighbor- 
hood of  (/i,  0).  If  L is  linear  and  compact,  H e C(H,  E)  is  compact  with 
//(A,  u)  = o(||m||)  at  u = 0 uniformly  on  bounded  A intervals,  and  n is  a 
characteristic  value  of  L of  odd  multiplicity,  then  (n,  0)  is  a bifurcation 
point  for  F with  respect  to  the  line  of  trivial  solutions. 
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The  proof  of  Theorem  12  uses  Leray-Schauder  degree  and  exploits  in 
particular  the  change  in  the  sign  of  the  Leray-Schauder  index  of  (/,  0)  as  A 
crosses  fi.  The  odd  multiplicity  of  fi  is  essential  as  the  example  of  Eq.  (3) 
shows. 

Actually  as  has  been  shown  by  Rabinowitz  [27],  the  hypotheses  of 
Theorem  12  imply  something  global  rather  than  local  about  the  behavior 
of  the  nontrivial  zeros  of  F.  Let  //'  denote  the  closure  of  the  set  of  nontrivial 
solutions  of  F = 0.  A component  of  Sf  is  a maximal  (with  respect  to 
inclusion)  closed  connected  subset  of//'. 

Theorem  13.  Under  the  hypotheses  of  Theorem  12,  //  contains  a 
component  % that  contains  (/i,  0)  and  either  intersects  r?ft  or  contains  ( fi , 0), 
where  fi  ^ fi  and  fi  is  a characteristic  value  of  L. 

Corollary  14.  If  ft  = IR  x £,  the  first  alternative  of  Theorem  13 
becomes  % is  unbounded. 

The  proofs  of  Theorem  13  and  its  corollary  involve  the  same  machinery 
as  Theorem  12.  See  also  Turner  [28]  and  Ize  in  [15]  for  other  proofs  and 
extensions.  Simple  examples  show  either  alternative  of  the  theorem  is 
possible.  Several  generalizations  of  Theorem  13  have  been  made  and  we 
will  mention  a few.  The  compactness  requirements  on  L and  H can  be 
dropped  at  the  expense  of  a local  result.  For  example  if  L e B(E,  E ),  n is  an 
isolated  characteristic  value  of  L of  odd  multiplicity,  I - fiL  has  Fredholm 
index  0,  and  H e C‘(ft,  £),  then  the  Lyapunov-Schmidt  procedure  reduces 
the  solution  of  F = 0 near  (/*,  0)  to  a local  finite-dimensional  problem, 
so  that  Brouwer  degree  can  be  used  to  conclude  bifurcation  for  this  case. 
More  general  dependence  of  the  linear  part  of  F on  A can  be  permitted. 
The  only  property  one  really  needs  is  that  (A,  0)  have  different  index  on  each 
side  of  (fi,  0).  Likewise  for  global  results,  compact  operators  can  be  replaced 
by  any  more  general  class  for  which  one  has  an  appropriate  notion  of 
index,  e.g.,  fc-set  contractions  (see,  e.g.,  Stuart  [29]).  Finally,  if  n is  a simple 
characteristic  value  of  L,  then  % consists  of  two  subcontinua  that  near  (/*,  0) 
meet  only  at  (n,  0)  [27].  Moreover,  within  the  context  of  Corollary  14 
Dancer  [30]  has  shown  each  of  these  subcontinua  has  a global  extension. 
An  earlier  result  in  this  direction  in  [27]  is  not  correct  as  stated. 

Degree  theoretic  methods  have  also  been  successfully  applied  to  bifurca- 
tion situations  involving  positive  operators.  The  additional  positivity 
structure  permits  the  elimination  of  the  odd  multiplicity  requirements  of 
Corollary  14.  The  analog  of  this  corollary  for  the  positive  operator  case  is 
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Theorem  15.  Let  AC  be  a cone  in  £,  L:  K -*  K be  compact  and 
linear,  and  H e C([R+  x K,  K)  with  //(A,  u)  = o(||u||)  at  u = 0 uniformly  on 
bounded  X intervals.  If  L has  positive  spectral  radius  and  p is  the  smallest 
positive  characteristic  value  of  L,  then  F(X,  u)  = u — X(Lu  + H(X,  u))  has  a 
component  of  nontrivial  zeros  that  meets  (p,  0)  and  that  is  unbounded 
in  IR  x AC. 

The  theorem  was  obtained  independently  by  Dancer  [31]  and  Turner  [32]. 
See  also  Nussbaum  [33]  for  an  interesting  result  in  this  direction. 

One  further  case  in  which  degree  theoretic  arguments  can  be  applied 
very  effectively  is  when  E = Y is  a complex  Banach  space  and  H is  complex 
analytic.  Then  Schwartz  [34]  has  shown  bifurcation  occurs  from  all  character- 
istic values  of  L.  Unfortunately,  the  complex  analytic  case  does  not  seem  to 
arise  in  physical  applications.  For  further  work  on  the  complex  case  where 
analyticity  is  not  necessarily  present,  see  Bohme  [35]  and  Ize  [36],  See  also 
Dancer  [41]  for  local  and  global  results  for  the  real  analytic  case. 

Next  we  study  the  class  of  problems  arising  when  F is  a potential 
operator  and  variational  methods  can  be  applied  to  the  bifurcation  question. 
Many  problems  in  elasticity  theory,  in  particular  the  example  of  Section  III, 
fall  into  this  category.  To  define  terms  more  precisely,  suppose  £ is  a real 
Hilbert  space,  U is  a neighborhood  ofO  in  £,  and / e Cl(U , IR).  Then  for  all 
u e U,f'(u),  the  Frechet  derivative  of / at  u,  is  a linear  map  from  £ to  IR, 
i.e.,  f'(u)  e £'.  Since  £ is  self-dual,  f'(u)  can  (and  will)  be  identified  with 
an  element  of  £,  i.e.,/':  U ->  E.  We  call  /'  a potential  operator.  The  first 
result  concerning  potential  operators  is  due  to  Krasnoselski  [11]. 

Theorem  16.  Suppose  feCl(U,  IR)  is  weakly  continuous  and  uni- 
formly differentiable  near  0 with  f'{u)  = Lu  + H(u),  L being  linear  (and 
compact)  and  H(u)  = o(||u|| ) at  u = 0.  If  p is  an  eigenvalue  of  L,  then  (p,  0) 
is  a bifurcation  point  for  F(X,  u)  = f'(u)  - Xu.  Moreover,  for  all  r > 0 and 
sufficiently  small,  F has  a zero  (A(r),  u(r))  with  ||u(r)||  = r and  X(r)~*  p as 
r ->  0. 

The  proof  of  Theorem  16  involves  variational  minimax  arguments  that 
are  in  part  topological.  A slightly  weaker  theorem  [/  e C2(Q,  IR)]  was  proved 
by  Marino  and  Prodi  [37]  using  Morse  theory.  More  recently,  Bohme  [35] 
and  Marino  [38]  have,  aside  from  their  smoothness  assumptions,  obtained  a 
stronger  result  than  Theorem  16. 

Theorem  17.  Let  / e C2(Q,  IR)  with  f'(u)  = Lu  + H(u),  L e B(E,  £), 
and  H(u)  = o(||u||)  at  u = 0.  Then  every  isolated  eigenvalue  of  L of  finite 
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multiplicity  is  a bifurcation  point  for  /-'(A,  u)  = f'(u ) — Au.  Moreover,  for  all 
r > 0 and  sufficiently  small,  F(A,  u ) possesses  at  least  two  distinct  zeros 
(A(r),  u(r))  satisfying  jju(r)||  = r and  A(r)  -»  n as  r -*  0. 

For  /even,  a much  stronger  result  obtains: 


Corollary  18.  If  / is  even  and  dim  N(L  — nl)  — n,  then  for  all  r 
sufficiently  small,  F possesses  at  least  n distinct  pairs  of  solutions  as  in 
Theorem  17. 

Actually  Bohme  and  Marino  permit  a slightly  more  general  term  than 
the  u term  in  F.  For  the  proof  of  Theorem  17,  they  essentially  reduce  the 
problem  via  the  method  of  Lyapunov-Schmidt  to  considering  / restricted 
to  a finite-dimensional  spherelike  submanifold  M of  {||u||  = r}.  The  maximum 
and  minimum  of J on  M then  provide  zeros  of  F,  A appearing  as  a Lagrange 
multiplier.  A similar  theorem  permitting  a more  general  dependence  of  F on 
A but  only  asserting  the  existence  of  one  solution  having  ||u||  = r has  been 
obtained  by  Berger  in  [39]  using  Morse  theory.  Fucik  el  al.  [40]  also  study 
generalizations  of  Theorem  17  in  another  direction. 

It  is  natural  to  inquire  whether  the  variational  structure  implies  any 
connectivity  properties  for  the  zero  set  of  F near  (//,  0)  as  was  the  case  when 
H had  odd  multiplicity.  Bohme  [35]  has  constructed  an  example  with 
E = [R2  and  fe  where  the  component  of  ff  to  which  (/i,  0)  belongs 
consists  only  of  (/*,  0)  so  connectivity  cannot  be  expected  in  general.  How- 
ever if  /is  real  analytic  a pair  of  curves  of  zeros  of  F bifurcate  from  (n,  0) 
[35,  41],  Recently,  in  connection  with  a buckling  problem  McLeod  and 
Turner  [42]  have  extended  Theorem  17  to  the  case  where  fe  C‘(£,  U)  and 
H(u)  has  a small  Lipschitz  constant  tending  to  0 as  ||u||  ->0. 

Another  approach  has  been  taken  towards  the  variational  bifurcation 
question  by  studying  the  number  of  zeros  of  F(A,  u ) near  (/u  0)  as  a function 
of  A rather  than  of  ||u||  as  in  Theorems  16  and  17.  In  particular,  Clark  [43] 
treated  the  case  where  fe  C‘(Q,  IR)  is  even,  L is  compact,  H = T + V,  where 
T is  homogeneous  of  degree  k > 1,  V(u)  = o(||u||*)  at  u = 0.  Under  further 
technical  conditions  on  V on  N(L  — ^ il ),  he  gave  lower  bounds  for  the 
number  of  zeros  of  F for  A > n (resp.  A < n)  in  terms  of  a topological 
measure  of  the  size  of  the  set  of  u e N{L  - nl)  such  that  ||u||  = 1 and 
(Tu,  u)  < 0 [resp.  (Tu,  u)  > 0].  The  method  of  proof  here  involves  a Galerkin 
argument  in  conjunction  with  minimax  techniques.  Quite  recently  another 
result  in  this  direction  has  been  obtained  by  Rabinowitz  [44]: 
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Theorem  19.  Under  the  hypotheses  of  Theorem  17,  either 

(i)  (/i,  0)  is  not  an  isolated  zero  of  F(n,  •); 

(ii)  There  exists  a one-sided  neighborhood  A of  n such  that  for  all 
X € A,  F possesses  at  least  two  distinct  nontrivial  zeros  (/,  u(X))  with  u(X)  -» 0 
as  /-♦//;  or 

(iii)  There  exists  a neighborhood  W of  n such  that  for  all  X e W - 
F possesses  at  least  one  nontrivial  zero  (X,  u(X))  with  u(X)  -> 0 as  X -» fj. 

The  proof  of  Theorem  19  involves  variational  minimax  arguments. 
Conditions  can  be  given  for  H,  or  more  precisely  its  primitive  h,  whereby 
the  behavior  of  h near  u = 0 determines  which  of  the  alternatives  of 
Theorem  19  occurs.  There  is  also  an  analog  of  Corollary  18  for  this  case. 

While  Theorems  17  and  19  complement  each  other  to  some  extent,  it 
seems  likely  that  a more  general  result  encompassing  both  of  them  is  still 
to  be  found.  It  is  interesting  to  observe  that  the  negation  of  alternative  (i) 
of  Theorem  19  is  essentially  a transversality  statement,  while  transversality 
is  essentially  built  into  the  framework  of  the  proof  of  Theorem  17,  so  perhaps 
this  is  the  notion  required  to  unite  the  two  results. 

Sather  [9],  Kirchgassner  [45],  and  Ize  [36]  have  also  studied  bifurcation 
problems  for  F(X , u)  = Lu  + H(u)  — Xu,  where  H consists  of  a homogeneous 
operator  plus  a term  of  higher  order  at  the  origin,  which  is  not  a potential 
operator. 

We  conclude  this  survey  with  some  results  involving  the  bifurcation  of 
equilibrium  solutions  of  u,  = F(X,  u)  to  time-periodic  solutions  and  bifurca- 
tion of  time-periodic  to  quasi-periodic  solutions.  The  first  result  we  know 
of  involving  the  former  is  due  to  Hopf  [24]: 

Theorem  20.  Suppose  F(X,  u)  is  real  analytic  near  (n,  0)  e IR  x IR" 
with  values  in  IR"  and  F(X,  0)  = 0 for  X near  fi.  Suppose  further  Fu(n,  0)  has 
exactly  one  pair  of  purely  imaginary  eigenvalues  a0,  a0  ^ 0,  which  are 
simple,  and  if  a(A)  is  the  corresponding  eigenvalue  of  FU(X,  0),  Re  oc'(0)  / 0. 
Then  there  exist  real  analytic  functions  (X(s),  u(t,  .s))  and  T(.s)  for  s near  0 
such  that  (A(0),u(f,  0))  = (0,  0);u(f,  s)  £ 0,  T(0)  = 27r/|a(0)|,and  (A(.s),  u(t,  s)) 
satisfy  (11a)  with  period  r(.s). 

The  proof  of  the  theorem  involves  setting  up  a Poincare  map  and  using 
the  implicit  function  theorem.  Hopf  also  studied  the  stability  of  the  periodic 
solutions  in  [24],  Hopf’s  work  has  been  carried  over  to  infinite-dimensional 
problems,  in  particular  in  the  context  of  the  Navier-Stokes  equation,  by 
ludovich  [46],  Sattinger  [13],  Joseph  and  Sattinger  [47],  and  Iooss  [14], 
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Unfortunately,  although  the  abstract  machinery  is  now  available  and 
examples  are  known  such  as  the  wavy  vortices  for  the  Taylor  problem, 
which  seem  to  be  of  this  type,  to  the  best  of  our  knowledge,  nobody 
has  succeeded  in  finding  a fluid  dynamical  example  where  the  hypotheses 
of  the  general  theorems  can  be  verified. 

The  Hopf  bifurcation  result  has  been  carried  one  step  further  by  Sacker 
[48],  (See  also  Ruelle  and  Takens  [49]  and  Lanford  [50].)  He  assumes 
that  (11a)  possesses  a one-parameter  family  of  solutions  (/,  u(t,  /))  for  / near 
H with  u 2^-periodic  in  t.  (Again  u is  an  n- vector.)  Suppose  further  that 
n — 1 Floquet  multipliers  have  magnitude  less  than  ! for  A < p and  that 
exactly  one  pair  of  these,  p(/)  = e2nr{,)  and  p(A),  leave  the  unit  circle  at 
/ = p,  i.e.,  Re  ot(p)  = 0,  Re  y.'(p)  > 0.  Then  if  p(p)q  ^ 1,  q = 2,  3,  4,  F is 
sufficiently  smooth  (C1 2 3 4 5 6 7 8 9),  and  another  technical  condition  is  satisfied, 
(A,  u(t , /))  bifurcates  into  an  asymptotically  stable  torus  as  / increases 
through  p. 

Sacker  gives  two  proofs  of  his  result : the  first  involves  setting  up  a Poincare 
map  to  find  an  invariant  curve  and  the  second  requires  the  solution  of  a 
first-order  quasi-linear  partial  differential  equation  that  is  carried  out  using 
an  elliptic  regularization  argument.  The  analog  for  the  Navier-Stokes 
equations  of  Sacker’s  result  has  been  carried  out  recently  by  looss  [51] 
using  [49]  and  [50]. 
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1.  Introduction 

The  considered  generalized  differential  system  is 

A[y,  r](f)  = -dz(t)  + [C(t)My(f)  - D(t)z(t)]  dt  + [dM(t)]uy(t)  = 0, 

L[y,  z](l ) = + A0 (t)y(t)  - B(t)z(t)  = 0,  uy(t)  = A2(i)y(t ), 

whereon  a given  real  interval  [u,  b]  the  matrix  functions  A0.  A ,,  A2 , B.  C, 
I).  and  M are.  respectively,  of  dimensions  n x n,  n x n,  n x n,  n x m,  m x n, 
m x m,  and  m x n,  with  /!,,  /12  nonsingular.  The  notation  employed  is 
consistent  with  that  of  references  [6  8]  of  the  author.  It  is  assumed  that 
A0,  A ,,  A2,  A , \ /12  B,  C,  and  D are  of  class  ii ' on  [«,  /?],  while  M e . 
For  T = { v : ye  y = A2  'uy  with  uye  a solution  of  (S)  is  a pair 
( V,  z)  e T x '-ff  such  that  L[y,  z]  = 0 a.e.,  while  A[y,  z]  - 0 in  the  sense  of 
the  Riemann  Stieltjes  integral  equation 

, / , r 

z[l)  = z( r)  + | [Cuy  - Dz ] ds  + | [dM]uy  = 0.  (f,  r)  e [u,  h]  x [u,  b], 

• r * r 

(1) 

For  a system  of  ordinary  differential  equations  to  which  (S)  reduces  when 
M is  constant,  some  of  the  merits  of  introducing  matrix  coefficients  At, 
A2  are  discussed  in  [8,  pp.  109,  110;  also  Problem  8,  p.  116].  For  (S)  in 
case  At  = A2  = £„,  see  [5,  6];  in  case  A 2 = En , see  [7],  The  real  scalar 
generalized  second-order  differential  equations  treated  by  Sz-Nagy  [9],  Feller 
[1],  and  Kac  and  Krein  [2]  are  particular  cases  of  (S).  The  comprehensive 
significance  of  systems  (S)  is  herein  illustrated  by  applications  to  systems 
with  interface  conditions,  to  integrodifferential  systems  with  integral 
boundary  cc  nditions,  and  to  systems  of  difference  equations. 
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For  brevity,  an  (r  + h)  x (s  + k)  matrix  M is  written  (M",  M12;  M2\ 
M22)  in  terms  of  the  component  matrices 

M12  = [M,.„J,  = M22  = [Mr+i ,+ J. 

where  x = 1,  r,  /}  = 1,  s,  i = 1,  ....  h,j  = 1 , k.  Correspondingly, 
an  (r  + /i)-dimensional  vector  r is  written  (i1 ; v2),  where  t'1  = (t,),  v2  — (ir+l). 


2.  Basic  Properties 

The  symbol  £ will  denote  the  class  of  y e £'  for  which  there  is  a z e Si” 
such  that  L[y,  z](f)  = 0 a.e.  on  [a,  b ].  Also,  we  set 

£°  = 1 y ■ y g Uy{a)  = 0 = uy(b)}. 

Correspondingly, 

= {C:  C e Si„x,  C = A*~  \ with  € »93„}, 
and  a solution  of  the  system 

Ml.  CKO  s + [fltWO  + /ir ‘(lMo*(l)C(<)]  <* 

+ = o,  <s.) 

Ml.  C](<)  = »'(<)  - o*(0i(0  - b*(0C(0  = 0, 

is  a pair  (i/,  £)  e S-Hm  x such  that  C]  = 0 a.e.  on  [a,  h],  and 

£]  = 0 in  the  sense  of  a Riemann-Stieltjes  integral  equation  related 
to  A*  as  (1)  is  related  to  A.  Correspondingly,  T*  signifies  the  class  of 
r]  e 9lm  for  which  there  is  a £ e Si„x  such  that  L^rj,  £]  = 0 a.e.  and 

= {n-  n e $* . = 0 = »/(*)}• 

One  may  readily  establish  results  corresponding  to  Theorems  2. 1-2.3  of 
[6]  and  Lemmas  2.2  and  2.3  of  [7],  relating  (1)  to  the  functional 

. b _h 

J[y,  z;tj,C]=  | [C*Bz  + tj*Cuy ] dt  + ) r\*[dM]uy. 

‘ a 'a 

In  particular,  as  an  extension  of  Theorem  2.3  of  [6],  we  have  that  if 
(/,  ij/)  e Si„  x ©23m  then  (y,  z)  is  a solution  of 

A[y,  z]  = #,  L[y,  z]  =/,  (S') 

iff  (y,  z)  = (y,  z — Muy)e  T x and  is  a solution  of  the  system 

A[y,  z]  = # + M/1  f Y dt,  L[y,  z]  =/,  (S') 
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where  A[y.  r]  = -dz  + [CA 2 y - Dz]  dt,  L[y,  z]  = Ax[A2  y]  + A0y  - Bz. 
while  Ax  = Ax,  A2  = A 2 . B = B.  A0  = A0  - BMA2 , D = D + .V/.4  2 ‘B.  and 
C = C — DM  +-  .V/.-4 1 1 /4 q /1 2 1 - -V/.4  f !B.V/.  In  particular,  if  *//  is  absolutely 
continuous  then  (S')  is  an  ordinary  differential  system.  For  p = ( uy : z)  and 
q = (17:  1),  let  rp  = (p(a):  p(h))  and  cq  = (q{a):  q(b)).  If  P is  an  r x 2(n  + m ) 
matrix  of  rank  r.  then  corresponding  to  the  classical  solvability  theorem 
for  ordinary  differential  equations[8.  Theorem  III.6. 2],  for  ( /. ip)  € x 
the  system 

A[>,  z](t)  = #(r),  L[y,z](t)=f(t),  P(cp)  = 0 (SB) 

has  a solution  iff  (»/*  dip  +-  £*/  dr)  = 0 for  all  solutions  (^7,  2)  of  the 
homogeneous  adjoint  problem 

A.[*  CKO  = 0.  L.[f/.  :](f)  = 0.  0(c*)  = 0.  (S?) 

w here  0 is  a [2(«  ■+■  m)  - r\  x 2 (n  4-  m)  matrix  of  rank  2 (n  + m)  - r satisfying 
PK*Q * = 0,  with  K = (-  J.  0:  0.  J)  and  J the  (n  + m ) x (n  + nt)  matrix 
(0.  0) 

3.  Applications 

Three  types  of  applications  will  be  presented 
A.  SYSTEMS  Will!  IN TLRFACE  CONDITIONS 

Consider  a system 

A[>,  z]=  -dz  + [Cy  - Dz]  dt  = dip,  L\y,  z]  = Ax°y'  + A°y  - tf'z  = 0. 

(S°) 

with  coefficient  matrix  functions  satisfying  conditions  analogous  to  those 
specified  in  the  introduction,  ip  e 'B sBm , and  interface  values  u,  (a  = 1 , a) 
satisfying  a = a„  < ux  < • • • < a„  < an  + , = b,  with  the  condition  that  (y,  z ) 
is  a bounded  solution  of  (S°)  on  each  subinterval  (afi.  ap),  fi=  1, . . .,  a + 1, 
having  unilateral  limits  at  each  division  point,  and  subject  to  end  and 
interface  conditions  P°(cp)  = 0,  y(a7* ) = H7yia7  ) (a  = 1,  ....  n)  where  P° 

is  an  r x 2 (n  -f  m)  matrix  of  rank  r,  and  //,(«  = 1 n)  is  a nonsingular 

n x n matrix.  If  u = uy  is  defined  as 

u(t)  = y(t)  on  ( an,b ), 

u(t)  = H„  - H7y(t)  on  (a  = 1 rr). 
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then  for 

A2(t)  = E on  (aa,b), 

A2(t)  =Ha-  H1  on  (a  = 1 a), 

a ♦ 1 

A^i)  = AiU{t)A2  '(/)  on  (J  (cip-i,  a„). 

v = \ 

the  system  (S"H)  consisting  of  (S,J).  P°(rp)  = 0.  and  interface  conditions,  is 
equivalent  to  a system  (SH)  with  M = 0.  P = P°(R.  0:  0.  En,m).  and  R the 
(n  + m)  x (»  + m)  matrix  (H,  1 H„  \ 0;  0.  Em ). 

H.  BOUNDARY  PROBLEMS  INVOLVING  INTEGRODIH  LRLNTIAL  SYSTEMS 
Many  considered  boundary  problems  may  be  written  in  the  form 

b . b 

x'(t)  + K(t)x(t)  + Q(f)  | [</T(s)]v(.s)  = 0(f).  ) [</0(s)].x(s)  = 0.  (ID) 

* a u 

where  the  vector  functions  x and  0 are  /i-dimensional.  K and  il  are. 
respectively,  k x k and  k x r matrix  functions  of  class  if*,  while  T e 
and  0etB '!<,*.  Let  5(f)  = 0 for  f e [u,  h).  S{h)  = -£r.  and  define  the 
(k  + r)  x (k  + r)  matrix  function  /0(f)  and  the  (/  + r)  x ( A.  + r)  matrix 
function  //(f)  as 

V0(f ) = (K(f),  O(f);  0,  0).  //(f)  = (0(f).  0;  T(f).  5(f)). 

Equation  (ID)  is  satisfied  by  v iff  y and  r are  vector  functions  of  respective 

dimensions  k + r and  / + r with  x — (yj  (a  = 1 k)  that  satisfy  with 

/ = (0;  0)  the  system 

-dz  + [</.//]>-  = 0,  >>'  + .c/0  y = J,  r(u)  = 0,  -'(/>)  = 0,  (2) 

which  is  of  the  form  (SH)  with  n = k + r,  m = l + r.  A , = A2  z E„ . H.  C.  and 
D zero  matrices,  while  /0  and  //  are  as  defined  above  Moreover,  in  terms 
of  t\  = (f/j ; f/2)  e \»lj  x \Mr  and  2 = (Cj : C2) e x , the  system  adjoint 
to  (2)  is 

t)  = 0,  -dc  + .</„*£  + [d.//*]n  = 0,  c(u)  = 0.  ;(/>)  = o.  (3) 

Then  r\  = t],  £ = C — • ^*1  arc  solutions  of  a system  of  ordinary  differential 
equations,  and  hence  absolutely  continuous.  In  terms  of  t\  = (r/, ; f/2).  where 
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i/,  and  tj2  are  constant  vectors  of  respective  dimensions  / and  r,  and 
C = (s,;C2)e'il*  X \U,withC,  = Cl  - 0*>7i  - V'th’Cz  = C2  ~ S*(f)»/2,  this 
system  may  be  written 

- + K'UKi  + ^’-(0^(0  = 0,  -CV  + n*(')Ci  + n*(r)G?(r)  = 0, 

Ci(fl)  + G(a ) = 0,  C2(u)  = 0,  Ci(fc)  + G(b ) = 0,  C2(/>)  - rj2  = 0,(4) 

where  0(f)  = 0*(r)>h  + T*(t)i/2.  For  the  case  of 

_ h ,h  ,h 

| [t/H'Jx  = C.x(a)  + Dx(/>),  | [</@]x  = /4x(a)  + Bx(b)  + j Hx  ds, 

* a * a % a 

(4)  reduces  to  the  system  adjoint  to  (ID)  as  determined  by  Krall  [3].  For 
differential  and  integrodifferential  equations  involving  integral  boundary 
conditions  the  recent  literature  has  been  rather  extensive,  but  for  brevity 
reference  is  limited  to  Krall  [3],  which  contains  a fairly  extensive  bibliography. 
For  (ID),  the  above  formulation  is  a unifying  one,  whose  simplification  is 
due  largely  to  the  fact  that  the  given  /(-dimensional  x-problem  is  extended 
to  a (A:  + / + 2r)-dimensional  problem,  wherein  the  boundary  conditions 
are  satisfied  by  all  absolutely  continuous  vector  functions  (y;  z)  that  vanish 
at  a and  b.  It  is  to  be  remarked  that  for  a special  case  of  a self-adjoint 
system  (ID)  the  reduction  to  an  equivalent  ordinary  differential  equation 
problem  was  employed  by  Reid  [4], 


C.  RELATED  DIFFERENTIAL  AND  DIFFERENCE  SYSTEMS 

Let  m = n,  A , - A2  = B = En,  and  A0,  C,  and  D be  zero  matrices,  while 
a = f0  < fj  < + i = b is  a partition  of  [a,  b],  and  M is  constant  on 

each  open  subinterval  (L_,,  t„)  (a  = I,  ...,  p + 1).  Then  for 

Ho  = M(a  + ) — M(a),  pt  = M(f/)  - M(ta~)  (a  = 1 p), 

/Vi  ~ M(h)  — M(b~), 

(y;  z)  is  a solution  of  (S)  iff  y is^the  polygonal  function  whose  graph  joins 
the  successive  points  (tj,  >(*,))  (j  = 0 , p + 1),  and  the  values  y(tj)  satisfy 

the  linear  second-order  difference  equation 

- >(U]  - (i/ViW-  y(L-i)]  - p,yU,)  = o 

(«  = i p). 
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where  6„  = tp+ , - i„  (P  = 0,  1 p),  while 

z(0  = /(t)  = (l/6fl)[y(tft+i)  ~ y(tp)\  on  + (i  = 0,  1 p, 

z(a)  = (1/<50)[>’('i)  - >’(f0)]  - Ho  y(u), 

z{lp  + i)  = ( + 1)  ~ _v(f/j)]  + [M{tp+ 1)  — M(tp)]y(tp+  j), 

P = 0,  1 p. 

The  intimate  relationship  between  scalar  second-order  linear  homogeneous 
differential  equations  and  systems  of  difference  equations  is  well  known.  In 
the  real  self-adjoint  case  the  existence  of  analogous  comparison  and 
oscillation  properties  of  solutions  dates  from  the  work  of  Sturm.  For  the 
author  the  relationship  between  differential  and  difference  equations 
remained  in  the  domain  of  analogy,  however,  until  his  study  of  generalized 
differential  systems  in  [5],  and  the  realization  that  both  were  special  instances 
of  a general  system,  wherein  for  the  self-adjoint  case  one  has  extensions 
of  the  classical  Sturmian  theory. 
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1.  Introduction 

Let  us  consider  a finite-dimensional  stochastic  system  whose  state  x(t ) = 
(xj(f),  x„(r))  is  a vector-valued  random  variable.  We  suppose  that  x(t) 

evolves  according  to  stochastic  differential  equations,  written  in  vector- 
matrix  notation  as 

dx  =f(x{t))  ilt  + o{x{t))  dw,  (1.1) 

where  w = (w,, . wj  is  a brownian  motion,  and  (1.1)  is  ;nterpreted  in  the 
I to  sense. 

It  is  generally  quite  difficult  to  calculate  the  probability  distribution  of 
x(f).  Actually,  one  often  wants  not  the  distribution  itself,  but  rather  certain 
related  quantities  (for  example,  the  moments).  Suppose  that  /or  a depends 
on  a small  parameter  e,  and  that  the  quantities  of  interest  can  be  found 
exactly  when  i:  = 0.  One  wishes  to  compute  these  quantities  approximately 
for  small  e ^ 0.  This  kind  of  problem  arises  in  many  applications,  and 
various  approximation  schemes  exist,  including  system  linearization, 
perturbation  expansions  in  powers  of  the  parameter  e,  and  truncation 
schemes. 

In  Section  2 we  briefly  review  some  literature  dealing  with  nearly  deter- 
ministic systems.  In  Section  3 we  turn  to  systems  that  we  call  nearly  linear. 
Some  results  are  stated  about  truncating  the  infinite  system  of  differential 
equations  for  moments.  Proofs  of  these  results,  and  extensions  of  them  appear 
in  [20].  In  Section  4 the  same  truncation  scheme  is  applied  to  a one- 
dimensional problem  in  population  genetics  theory,  concerning  the  rate  of 
decay  of  mean  heterozygosity  in  case  of  nearly  neutral  genes.  It  appears  that 
this  technique  may  be  useful  for  more  dimensional  genetics  models.  Finally, 
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in  Section  5 we  mention  some  results  about  optimal  stochastic  control 
problems  that  are  either  nearly  deterministic  or  nearly  of  linear  regulator 
type. 


2.  Nearly  Deterministic  Systems 


Now  suppose  that  (1.1)  takes  the  form 

dx  = f(x(t))  dt  4-  (2 1:)1  2rr(.v(/))  dw,  (2.1 ) 


with  e a small  positive  parameter.  A variety  of  questions  can  be  asked; 
which  of  these  are  well  posed  and  interesting  depends  to  a considerable 
extent  on  the  structure  of  the  unperturbed  dynamical  system  [/;  = 0 in  (2.1)]. 
One  can  always  consider  results  for  fixed  t (finite).  For  small  e,  the  distribu- 
tion of  ,x(t)  will  be  approximately  gaussian  and  centered  about  the  unper- 
turbed system  trajectory.  A "ray”  method  for  finding  the  distribution 
approximately  has  been  given  by  Cohen  and  Lewis  [4],  It  involves  the 
solution  of  a nonlinear  first-order  partial  differential  equation. 

In  many  problems,  it  suffices  to  compute  expectations  £d)[.\'(f)]  for  various 
choices  of  function  d>(.v),  rather  than  to  compute  the  distribution  of  x(t). 
An  example  is  the  moments  of  x(t).  Now  x(t)  is  a Markov  diffusion  process 
with  generator 

df  ~ df°  4-  e.  //, 


J32 


y'°  =/•  v. 


.// 


X a>j  - Y a Y ’ aij  X °ik  Gjk  ■ 

i.7=l  i t X j fc=l 


(2.2) 


Let  us  suppose  that  x(0)  = y,  and  regard  the  desired  expectation  as  a 
function  of  t and  y: 

«Hf,y)  = £<P(x(f)). 


Under  suitable  technical  assumptions  (f>  satisfies  the  backward  equation 

ccfr/ct  = y'(p . 


with  the  initial  data  0(0,  y)  = One  can  seek  an  asymptotic  expansion: 

0 = 0°  + e0l  + c202  + ■■■  + £m0m  4-  0(em  + 1 ).  (2.3) 

The  coefficients  0X,  02 , ...  satisfy  linear  first-order  partial  differential  equa- 
tions, obtained  by  formal  substitution  in  (2.3).  Under  suitable  assumptions, 
the  validity  of  (2.3)  can  be  shown  either  by  methods  of  partial  differential 
equations,  or  by  a probabilistic  method  [6]. 
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Another  quantity  of  interest  is  £<D(x(r)),  where  t is  the  exit  time  of  x(t) 
from  some  bounded  open  set  B.  If  we  denote  this  by  (f)(y ),  then  y'</>  = 0 
in  B and  $ = < ton  the  boundary  PB.  An  expansion  like  (2.3)  holds  in  any 
“regular”  subregion  B'  of  B from  which  characteristic  curves  lead  to  PB 
and  intersect  PB  nontangentially.  See  [6,  Section  5].  Holland  [12]  has  given 
a probabilistic  treatment  of  the  boundary  layer  expansion,  near  points  of  PB 
from  which  the  characteristics  lead  into  a regular  subregion  of  B.  If  all 
characteristics  are  asymptotically  stable  to  a point  inside  B,  then  0(y)  behaves 
entirely  differently  for  small  c.  See  Vent’sel  and  Freidlin  [21]. 

In  a number  of  physical  applications  <t>(x)  can  be  regarded  as  “energy” 
associated  with  state  x,  and  If0®  = 0.  Energy  is  constant  along  the  deter- 
ministic trajectories  (*;  = 0).  Then  Pf®  = i;.i/®\  it  is  reasonable  to  consider 
times  of  order  e~  '.  In  some  problems  (arising  for  instance  in  wave  propaga- 
tion in  random  media)  there  is  an  approximating  diffusion  process  on  a 
new  scale  r = ct,  after  introducing  new  state  variables  and  appropriately 
centering  the  process.  See  Papanicolaou  [18.  Section  2;  19,  Sections  4,  8]. 
In  other  cases,  in  which  /(x)  is  nonlinear,  only  information  about  the  rate 
of  growth  of  mean  energy  is  known.  See  Carrier  [3]  and  Fleming  [6,  Section  3, 
Example  2], 

If  the  process  x(t)  is  ergodic,  one  can  ask  for  an  asymptotic  expansion 
in  powers  of  i:  for  J ®{x)n(dx),  where  g is  the  equilibrium  distribution  (the 
integral  is  over  n-dimensional  space).  Such  expansions  are  obtained  by 
Holland  [10], 

A method  of  perturbing  the  time  scale  in  powers  of  t,  following  Poincare, 
was  applied  by  Amazigo  [1]  to  a problem  of  buckling  for  columns  with 
random  imperfections. 


3.  Nearly  Linear  Systems 


Let  us  now  suppose  that  (1.1)  has  the  form 


dx  = [Ax(t)  + C0(x(f))]  dt  + 


a + X /WO 


dw. 


i = 1 


(3.1) 


and  g(x)  is  a polynomial  of  degree  /.  We  summarize  a truncation  method 
for  the  moments,  which  is  analyzed  in  detail  in  [20].  For  the  validity  of 
the  method,  some  further  conditions  on  g are  needed.  These  conditions 
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imply,  in  particular,  that  solutions  of  (3.1)  do  not  explode  in  finite  time,  and 
that 


(d/dt)EQ>(x(t))  = £{^<D(x(f))}  (3.2) 


for  any  polynomial  <D(x). 

When  c = 0 we  denote  the  solution  by  x°(f): 


dx°  = Ax°(t)  dt  + 


a + 


i=  1 


dw. 


(3.1°) 


We  call  the  system  (3.1°)  linear  [some  authors  say  bilinear,  in  view  of  the 
products  Xj°(t)  dw].  If  each  /I,  =0,  then  x°(f)  is  gaussian.  Its  distribution 
can  be  found  explicitly  by  solving  linear  ordinary  differential  equations 
for  the  means  and  con  variances.  See,  for  instance,  [6,  Section  3,  Example  1], 
If  some  ^ 0,  the  distribution  of  x°(r)  can  no  longer  be  found  exactly. 
However,  a system  of  linear  ordinary  differential  equations  for  the  moments 
of  order  up  to  any  given  m can  still  be  derived.  This  is  seen  by  applying 
(3.2)  to  any  polynomial  <I>,  and  observing  that  is  a polynomial  of 
degree  no  more  than  the  degree  of  <1>. 

For  i:  > 0,  is  a polynomial  of  degree  < m + / — 1 if  <I>  is  a polynomial 
of  degree  <m.  By  applying  (3.2)  to  any  polynomial  of  degree  <m,  we  get 
a linear  system  of  differential  equations  for  the  moments  of  x(t)  of  order 
<m.  This  system  is  not  closed,  since  the  right  side  may  involve  moments 
of  orders  m + 1, . . . , m + / — 1.  However,  an  approximation  for  the  moments 
of  orders  < m that  is  correct  up  to  order  <;*+  1 can  be  obtained  by  truncating 
the  infinite  system  of  differential  equations  for  the  moments.  This  is  done  by 
ignoring  all  moments  of  order  greater  than  some  mk. 

Let  us  describe  the  results  when  x(/)  and  w(f)  are  scalar  processes.  For 
the  vector  case,  only  notational  changes  are  needed  to  describe  the  truncation 
method.  However,  in  that  case  the  conditions  given  in  [20]  to  ensure  its 
validity  are  more  complicated.  Consider  the  moments 

Hj{t)  = E[x(t)]j,  j = 0,1,2,.... 


Assume  that  the  polynomial 

9(x)  = £0,x' 

i = 0 


has  odd  degree  /,  and  that  g,  < 0.  By  (3.2)  with  <I>(x)  = xj,  we  get 

/ 

dnj/dt  = \j(j  - 1 )[a Vj-  2 + 2a figj.  , + /i2g J + jAftj  + r.j  £ gt ,. 

( = 0 


(3.3) 
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For  c = 0,  this  linear  system  of  equations  for  /^,  7 < m,  is  closed.  Given  m 
and  k,  let 

mk  = m + k(l  — 1), 

and  let  nk,  j = 0,  1,  mk  be  the  solution  of  the  linear  system  obtained 
by  omitting  from  (3.3)  all  terms  nr  with  r > mk.  The  initial  data  are 

h>(  0)  = h/(  0)  = yj,  y = x(0). 

Then  it  can  be  shown  that,  for  fixed  T,  there  exists  a constant  C = C(T,  k,  m) 
such  that 

- Hjk(t) | < Cek  + \ 7=1 m,  0 < y < T.  (3.4) 

See  [20,  Theorem  3.1].  Actually,  for  the  proof  of  this  result,  as  well  as  for 
calculations  in  examples,  it  is  more  convenient  to  rewrite  the  truncated 
system  as  a system  of  linear  differential  equations  for  fik,  defined  by 

£/(*)  = /*/(').  y'  = 0,  1, m, 

Mjk(0  = eMjk(0’  j = m+  1 m, 

h/(0  = 7 = mk_  1 + 1 mk . 

Under  stronger  assumptions  the  approximation  of  Hj(t)  by  nk(t)  is  uniform 
for  all  t > 0.  In  fact,  if 

A < 0,  ( mk+l  - \)fi2  <2\A\, 

then  the  constant  C = C(/c,  m)  in  (3.3)  can  be  taken  independent  of  T.  See 
[20,  Corollary  3.1], 

In  formal  truncation  schemes  proposed  for  various  applications,  moments 
of  orders  exceeding  the  truncation  level  mk  are  often  replaced  by  appropriate 
combinations  of  moments  of  orders  <mk,  not  simply  omitted  as  we  have 
done.  See  for  instance,  Amazigo  et  al.  [2,  p.  1352]  and  Kushner  [14],  Such 
modifications  should  improve  the  estimate  (3.4);  but  we  have  not  attempted 
a rigorous  treatment  of  this  idea.  For  another  class  of  truncation  procedures 
in  the  theory  of  turbulence,  see  Lundgren  [15], 

Let  us  conclude  this  section  with  a two-dimensional  example  [20,  p.  42]. 

Example.  Let 

dx t = x2  dt,  dx2  = —(.x,  + bx2  + ex,3)  dt  + <7.x,  dw, 
with  h.  i ;,  a positive,  h,  a fixed,  and  i;  small.  Let 

H.j-iO)  = £[x1(l)ix2(/y'i],  i = 0,  1 j. 
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The  truncation  method  consists  of  omitting  terms  in  the  equations 
corresponding  to  (3.3)  of  order  j > m + 2k.  If  n 2 < b(m  + 2 k — 3)"  ',  then 
the  estimate  corresponding  to  (3.4)  holds  uniformly  for  all  t > 0.  For  instance, 
suppose  that  we  wish  to  estimate  moments  of  order  2 up  to  terms  involving 
f2.  In  this  example,  the  equations  for  even-  and  odd-order  moments  are 
uncoupled.  The  truncated  system  consists  of  15  linear  differential  equations 
for 

Mi.  2 - i Mi,  2 - i ’ Ml,  4 - i *'Mi.  4 - i t Mi,  6 - i ® Mi,  6 - i • 


4.  An  Application  in  Population  Genetics  Theory 

We  consider  a one-dimensional  example  of  a population  with  two  possible 
gene  types  (alleles)  at  a given  locus  on  a chromosome.  This  example  has 
already  been  analyzed  in  detail  by  classical  analysis.  See  Crow  and  Kimura 
[5,  Chap.  8]  and  Miller  [16].  By  a truncation  method  like  that  in  Section  3 
we  can  reproduce  fairly  easily  some  of  the  known  results,  for  the  case  of 
nearly  neutral  genes.  The  truncation  method  could  be  applied  to  multi- 
dimensional models  where  classical  analysis  does  not  seem  available,  as  we 
shall  indicate  at  the  end  of  the  section. 

Let  x(t)  denote  the  frequency  of  one  of  the  two  gene  types,  0 < x(t)  < 1. 
It  is  a one-dimensional  diffusion;  the  generator  takes  the  form 

y,  = x(  1 - x)(d2/dx2)  + ex(l  - x)[h  + (1  - 2h)x]  d/dx, 

provided  time  is  measured  in  units  of  4 N,  where  N is  the  population  size. 
Moreover,  f = 4Ns  where  s and  h are  certain  constants  appearing  in 
coefficients  of  selective  advantage.  The  disappearance  of  a gene  type 
corresponds  to  x = 0 or  x = 1.  There  is  no  mechanism  such  as  recurrent 
mutation  in  the  model  for  reintroducing  an  extinct  type;  the  boundary 
conditions  at  0 and  1 are  absorbing.  We  take  f small  (near  neutrality, 
neutrality  when  e = 0). 

When  f = 0 the  eigenvalues  of  If0  = x(l  — x)  d2/dx2  are  A,0  = — i(i  -I-  1), 

i=  1,2 See  Crow  and  Kimura  [5,  p.  383],  The  quadratic  ^(x)  = .x(l  — x) 

is  an  eigenfunction  corresponding  to  the  dominant  eigenvalue  Aj°  = —2.  Let 


H( I)  = £[*(!)(!  -*(!))]  = £4>,Ml)). 
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Then  2 H(t)  is  an  important  quantity  called  the  mean  heterozygosity.  When 
£ = 0,  we  get  from  (3.2)  and  = -2<D,  the  well-known  formula 

//°(r)  = y(l  -y)e-2\ 

where  y = x(0)  is  the  initial  gene  frequency. 

We  are  interested  in  estimating  Aj  and  H(t)  for  e small.  Once  is 
estimated,  one  can  also  estimate  such  quantities  of  interest  as  the  rate  of 
absorption  of  x(r)  at  0 or  1 and  the  expected  time  to  absorption.  For  this 
purpose  consider  the  polynomials 

^2.-1  = [*0  “ *)]',  1=1,2,..., 

<F2,.(x)  = [x(1  -*)]'■(  I — 2x),  /=  1,2,.... 

Also  let 

Hl(t)  = H(t)  = EQl(x(t)), 

Hj(t)  = j = 2i,  2i  + 1,  / = 1,  2, . . . , 

H* -(A*,. 

By  applying  (3.2)  to  <D  = O,,  i = 1,  ...,  2 k + 1,  we  get  a system  of  linear 
differential  equations  of  the  form 

dHk/dt  = A*H*  + E*+1r*,  (4.1) 

where  A*  is  a (2k  + 1 ) x (2k  + 1 ) matrix  depending  on  e and  each  component 
of  Tk  is  a linear  combination  of  £<t>2k  + 2(x(r))  and  £<J)2*  + 3(x(t)).  When  e = 0, 
Ak  is  an  upper  triangular  matrix,  with  diagonal  elements  Aj0,  A2°,  A3°,  ..., 
/2\+ ,.  Instead  of  (4.1)  consider  the  truncated  system 

dUk/dt  = Afc  (Hfc  (4.2) 

with  H*(0)  = Hk(0)  = H0k.  Then 


Hk{t)  = exp  (A kt)H0k  + ek+ 1 f exp  [Ak(f  - s)]Tk(s)  ds,  (4.3) 

• o 

Hk(f)  = exp  (Akr)H0k.  (4.4) 

From  (4.3)  and  (4.4)  we  see  that  IH)k(f)  - Hlk(t)  is  of  order  ek+1  uniformly 
for  r in  any  finite  interval.  Therefore,  in  the  expansion 

H(t)  = H°(t)  + £/,(f)  + £2/2(r)  + ••■  + Eklk(t)  + 0(£k+1)  (4.5) 


no 
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where  Hl  is  the  first  component  of  the  vector  ffl.  By  repeated  differentiation 
of  (4.2)  with  respect  to  r.,  we  get  linear  differential  equations  for  /,,  Ik. 
The  initial  data  are  /;(0)  = 0.  We  solved  these  equations  to  get 

1,(1)  = H°(l)[c,  - A(1  - 2*)I  + J(1  - 2y)e-“'  -}(l  - 2h)c2e- 

t'i  = «(1  - 2)1)  + j(l  - 2(i)c2,  c2  = y(t-y)~l  y = x(0), 

with  a more  complicated  expression  for  l2(t). 

Estimates  for  the  dominant  eigenvalue  Let  Aj  be  the  dominant 
eigenvalue  for  the  matrix  A*.  We  know  that,  when  e = 0,  A,0  = A,°  = — 2. 
Let  us  show  that,  for  small  c / 0, 

| Aj  — Aj  | < Cek+  1 (4.6) 

for  some  constant  C.  The  proof  will  use  the  fact,  from  eigenfunction  expan- 
sion of  the  probability  density  of  x(t)  [5,  p.  408],  that  exp  ( — klt)E<t>j(x(t))  is 
bounded  for  each  j,  and  exp  ( — A,r)E<I>,(.x(i))  = exp  (-A,r)//(f)  is  bounded 
away  from  0.  Let  Vx  be  an  eigenvector  for  Ak  corresponding  to  the  eigen- 
value A,.  For  small  e,  the  eigenvalues  of  A*  are  simple  with  A,  near  —2. 
We  can  take  a basis  of  eigenvectors  Vx,  V2,  ...,  F2k  + 1 with  K near  K°  = 
(1,  0,  ...,  0).  Let  [F],  denote  the  component  of  a (2k  + l)-dimensionaI 
vector  Y with  respect  to  Vx.  Then 

[exp  (Akf )T],  = exp  (Ajf)[y]j. 

By  taking  components  in  (4.3) 

[ 0-0*(f )] , = exp  (A1l)[H0k]1  + Ek+  1 I exp  [AJf  - .v)][r*(.s)],  ds. 

• o 

When  e = 0,  H0*  = (y(  1 — y),  0,  . . . , 0)  and  thus  = y(  1 — y).  Since  Ak 

does  not  depend  on  the  initial  data  y = x(0),  we  may  take  y = j in  this 
proof.  Then  [Hq*],  > 5 for  small  e.  Since  Tk(.s)  is  a linear  combination  of 
*(*(*))  afld  ^fc+sM5))* 

|[rk(.v)]i  | < c exp  (A,. v)  for  some  c. 


Case  1.  A,  > A,.  For  some  c,  > 0 we  have 

ck+lc 

c,  exp  (A,/)  < [H *(/)],  < exp  (A,!)^*],  + ■ _ ■ 

We  take  t large  and  find  that 

A,  — A,  < cc\  ,e*  + *. 


1 


[exp  (A,f)-exp  (A,t)]. 
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Case  2.  A,  < A,.  For  some  c2  we  have 

/■*  + 1 c 

C2  exp  (A, 0 > g exp  (A,f)  - ' [exp  (A,f)  - exp  (A,/)]. 

/,  — 

We  again  take  l large  and  find  that 

A,  - A,  < 8 cek+ 

These  two  estimates  give  (4.6). 

We  computed  A,  approximately  using  (4.6)  in  two  cases,  namely,  h = { 
and  h < 0.  When  h = \,  g(x)  = 2x(l  — x)  is  quadratic.  The  number  of  linear 
differential  equations  needed  for  accuracy  of  order  i:k+ 1 is  now  k + 1 rather 
than  2k  + 1.  For  k = 2,  one  finds  by  applying  the  implicit  function  theorem 
to  the  3 x 3 matrix  A2  that 

A,  = -2(1  +&;2)  + 0(«4), 

which  agrees  with  Kimura’s  power  series  [5,  p.  398]  for  A,  up  to  fourth- 
order  terms.  The  case  h < 0 is  called  overdominant.  We  estimated  A, 
numerically  using  (4.6),  and  obtained  results  agreeing  well  with  those  of 
Kimuraand  Miller  [16]  [5,  p.  412]  in  the  range  about  0 < i:  < 3,  i.e„  Ns  < 0.75. 

The  above  is  just  a different  treatment  of  a problem  already  thoroughly 
studied  by  classical  analysis.  Let  us  next  turn  to  a two-dimensional  example, 
in  which  classical  analysis  does  not  seem  to  be  an  available  tool.  In  this 
example,  the  population  is  situated  in  two  niches,  with  a certain  rate  m of 
exchange  of  genes  between  niches.  Let  N be  the  number  of  individuals  in 
each  niche,  and 

y.  = 4Nm , i:  = 2 Ns,  h = j. 

Let  x,(i)  = gene  frequency  in  niche,  i — 1,  2. 

The  generator  of  the  two-dimensional  diffusion  (x,(/),  x2(f))  is 

(l2  d2 

y,  = X‘°  + 

, . <’  , , a 

+ a (x, 

0 0 

+ ext(l-xt)^  + x2(l  — *2)  (~ix^ 
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Consider  the  problem  of  finding 

W1(f)  = £[x1(f)(l-.v1(f))] 

H2(t)  = E[(  1 - 2xl(t))(x2(t)  - x,(r))] 

H3(t)  = E[x2(t)(l-x2(t))]. 

2H,(t ),  2H3(t)  are  the  within-niche  heterozygosities  and  H2(t)  + 2 H {(t)  is 
the  between-niche  heterozygosity.  When  e = 0,  the  vector  0-0  = (H^  H2 , H3) 
satisfies 


dU/dt  = AH,  A = 


— 2 a 

4-2  a -4a 
2a  a 

A,  = -(1  + 2a)  + (1  + 4a2)1  2, 


/2  = 


0 
4a 

— (2a  + 2)_ 
-(1  + 2a). 


For  c / 0 one  is  dealing  with  a selection-migration  model  with  finite 
population  sizes  in  each  colony.  Near  neutrality  means  e near  0.  The 
truncation  method  outlined  above  could  be  applied,  but  we  have  not  done 
the  calculation. 

A similar  method  of  linear  differential  equations  (or  difference  equations 
in  discrete  time)  has  been  used  for  other  problems  in  neutral  gene  theory. 
We  refer,  in  particular,  to  the  problem  of  linkage  disequilibrium  for  2-locus 
models;  see  Hill  and  Robertson  [8],  Kimura  and  Ohta  [13,  Chapter  7], 
and  Nei  and  Li  [17],  Again  our  truncation  method  could  be  applied  if 
near  neutrality  holds,  at  the  expense  of  increasing  substantially  the  number 
of  linear  differential  equations  that  must  be  simultaneously  treated. 


5.  Optimal  Stochastic  Control 

Let  us  now  suppose  that  the  stochastic  differential  equation  describing 
the  process  x(f)  depends  on  a control.  For  a nearly  deterministic  system, 
instead  of  (2.1)  let  us  take 

dx  = f(x(t),  u(t))  dt  + (2e)'  2ct(x(i))  dw\  (5.1) 

where  u(t)  is  the  control  applied  at  time  t.  Given  a criterion  of  system 
performance,  the  problem  is  to  calculate  approximately  the  optimal  expected 
performance  and  an  optimal  control.  For  closed-loop  (feedback)  controls 
this  problem  is  treated  in  [7]  and  [9],  and  for  open-loop  controls  in  [1 1], 
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In  [20.  Chapter  3J  a nonlinear  perturbation  ol  the  stochastic  linear 
regulator  is  treated  The  system  equations  are  now 

dx  \Ax(t)  + liu(i)  -f  .,|<y(x(/))jj  dt  + o(t)dw(t)  (5.2) 

with  no  constraints  on  the  control  u(l)  and  with  the  usual  quadratic 
performance  criterion  Under  suitable  assumptions  on  the  uncontrolled 
system  [with  u(r)  0 J.  it  is  shown  that  there  is  an  expansion  of  the  optimal 
feedback  control  in  powers  of  i.  If  x(f)  is  a scalar-valued  process,  these 
assumptions  hold  if  q(x)  q,x'  + cy,  ,x'  1 +•••  + q{) , with  / odd  and  g,  ^ 0. 
I he  method  is  first  to  find  ari  expansion  of  the  corresponding  solution  of 
the  dynamic  programming  equation  in  powers  of  A crucial  estimate 
is  an  a priori  linear  growth  rate  as  |x|  * / for  the  optimal  feedback 
control  as  a function  of  state  x. 
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1.  Introduction 

It  is  well  known  that  Hadamard's  postulates  for  a well-posed  problem 
are  inadequate  for  those  problems  of  mathematical  physics  described  by 
quasilinear  elliptic  equations.  Even  if  a solution  exists  it  is  not  unique  in 
general.  The  stationary  Navier  Stokes  equations,  e.g.,  possess  a continuum 
of  solutions  in  many  physically  interesting  cases.  Among  all  possible 
solutions  nature  selects  according  to  stability  properties  (cf.  [17]). 

The  question  arises  naturally  whether  there  are  constructive  methods 
simulating  the  behavior  of  nature  and  selecting  the  physically  relevant 
solution.  Usually,  analytic  and  numerical  procedures  describe  the  whole  set 
of  solutions.  Those  methods  are  known  for  bifurcation  problems  near  a 
simple  eigenvalue  (cf.  [IX]).  Recently  algorithms  have  been  proposed  on  a 
combinatorial  basis  that  determine  all  fixed  points  of  a mapping  [1], 

We  describe  an  iteration  process  that,  under  certain  assumptions,  possesses 
the  required  selection  property.  It  can  be  considered  a nonlinear  version 
of  the  well-known  Picard  Poincare  Neumann  method.  The  sufficient  part 
of  the  criterion  for  local  convergence  is  trivial,  whereas  the  necessary  part 
is  deeper  and  important  for  the  selection.  We  show  the  applicability  for 
some  semilinear  problems  in  a partially  ordered  Banach  space.  Finally,  it 
is  shown  that  certain  stability  exchange  phenomena  of  bifurcation  theory 
are  simulated  by  this  iteration  process. 

2.  A Selective  Iteration  Procedure 

Let  A"  be  a real  Banach  space  and  L'( X)  the  space  of  bounded  endo- 
morphisms  on  X with  the  uniform  operator-norm.  We  call  LeM(X) 
1 -separable  if  the  spectrum  n(L)  of  the  natural  complexity  of  L decomposes 


* English-language  version  of  an  article  in  " Jahresbericht  der  Deutschen  Mathematiker- 
Vereinigung."  Volume  77.  1975.  appearing  through  the  courtesy  of  If  Ci.  Teubner,  Stuttgart. 
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into  two  disjoint  parts  al  and  a2,  n2  being  nonempty,  and  if 

sup  | A | < inf  j A | = a,  a > 1 

(7,  n2 

holds.  Compact  linear  operators  L having  spectral  radius  spr  (L)  greater 
than  1 are  1 -separable. 

The  theorem  of  Picard-Poincare-Neumann  states  that  L"x  converges  to  0 
for  all  x e X if  spr  L < 1.  Necessary  for  N = X,  where 

N = {x  c X |lim  Fix  = 0} 

is  spr  L < 1 [25]. 

For  a 1 -separable  L let  P (resp.  Q)  denote  the  Dunford-projectors  to 
(7 ! (resp.  o2)  [13,  p.  178]: 

P = - * j (L  - zid)~  1 dz,  Q = id  - P 

“ |z|  = 6 

where  supai  |/|  < h < a holds. 

One  has  X = PX  © QX.  The  closed  subspaces  PX,  QX  are  invariant  with 
respect  to  L.  There  are  constants  a > 1 , p > 0,  and  a norm  x = Px j)  + Qx 
equivalent  to  the  original  norm  in  X such  that 

||L<2*||  > a\\Qx\\,  \\LPx\\  <(a-  p)\\Px\\  (2.1) 

holds.  Subsequently,  this  norm  will  be  used. 

A Frechet-differentiable  mapping  F:  X -*  X is  called  a Fredholm  operator 
if  F'(x)  is  a Fredholm  operator  for  every  x e X F is  called  real  analytic  in 
X if,  for  all  x,  he  X,  there  are  P„(x;  h)e  X , P„(x;  •)  being  a continuous 
operator  homogeneous  of  degree  n , and  Pn(-;  h)  being  continuous  in  X , 
such  that 

F(x  + h)=t  lPn(^h) 

n-0  n- 

is  satisfied,  where  the  series  converges  normally  [11,  p.  769],  and  B denotes 
the  closure,  B the  interior  of  Be  X,  and  Br(x)  = {y  e X \\\y  - x\\  < r 
Let  F:  X -»  X with  F( 0)  = 0 be  continuously  differentiable  in  some  B,(0), 
i.e.,  F'\  Br(Q)  -*  ilfX)  exists  and  is  continuous;  then  F"(x ) converges  to  0 
for  all  x in  a neighborhood  of  0,  if  spr  F'( 0)  < 1 . If,  however,  F'( 0)  is  1 -separ- 
able one  can  construct  examples,  even  for  C'  -mappings,  such  that  the  set 
N = {x  g X | F"(x)  -*  0}  is  a neighborhood  of  0 [30]. 
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Theorem  2.1.  Let  F:  X -»  X be  continuously  differentiable  in  a neigh- 
borhood of  0.  and  let  F'( 0)  be  1-separable.  Furthermore,  let  one  of  the 
following  conditions  be  satisfied: 

(i)  F(Bl,(x))  ^ 0 for  all  x e X,  p > 0. 

(ii)  F is  continuously  differentiable  in  X and  Fredholm,  with 
dim  QX  = x. 

(iii)  F is  real  analytic  and  a Fredholm  operator  with  ind  F'( 0)  = 0 and 
dim  A"  F'(0)nX  < m0  < x for  all  ne  N. 

Then,  the  set  N is  of  first  category. 

All  applications  yet  made  use  condition  (iii).  Some  will  be  describe  in 
the  next  sections.  Only  a sketch  of  the  proof  will  be  given.  For  a more 
detailed  presentation  cf.  [30]. 

Lemma  2.2.  Let  F:  X -*  X be  continuous  and  continuously  differenti- 
able in  a neighborhood  of  0 and  let  F'( 0)  be  1-separable.  Then,  there  exists 
a double  cone  K c:  X and  a ball  Bs  = Bs( 0)  such  that  x,  ye  Bs,  x — ye  K 
implies  for  some  neW, 

F"(x)  > s or  Fn(y)\\  > s 

Proof.  Let  a , />  be  as  in  (2. 1 ).  Choose  q.r]  > 0 such  that  a > ( 1 + q)(  1 + q) 
and  define  K as  follows: 

K = {xe  A'lUPxll  < q\\Qx\\},  K = K - {0} 

Define  a = pq/a , 6 = min  (r/,  a/(l  + </)2),  and  set  L = F'( 0).  There  is  a ball 
Bs  where  L - F'(y)\\  < 6 holds.  For  xe  K one  obtains  immediately 

ll^x  — PF'(y)x\\  < <5(1  4-  q)\\QLx\\ 

\\QLx  - QF'(y)x\\  < 6(\  + q)\\QLx\\ 

We  show  first: 


(1) 

LK  c 

- K, 

(2) 

l|Lv|| 

> (1  + fj)||x|j  for  x e K,  and 

(3) 

for  arbitrary  y,.  ....  yn e Bs  and  xe  K, 

v = £ *j  F'(yi)x  e 1 v\\  > ( 1 + rj 

J*  i 

holds 

if  Xj  > 

0 and  Yj=  i Xj  = l. 
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In  view  of  (2.1)  and  x e K one  obtains 

\\PLx\\  = ||LPx||  < (u  - p)q  Qx||  <(q~  ff)||QLx|| 

and  hence  (1).  Furthermore,  x e K yields  (2): 

| Lx ||  = ||PLx||  + ||QLx||  > a||ex||  > (1  + q) ||x|| 

For  the  proof  of  (3)  one  uses  (1)  and  (2.2): 

\\Qv\\  > «CLx||  - Qi<*j(F(yj)-L)x  > (1  - 6(1  + q))\\QLx\\ 
j=  i 

||Pi'||  < ||PLx||  + P jr  Xj(F'(yj)  — L).v  < (q  - a + 6(1  + q))\\QLx\\ 

J=  i 

Since  QLx  # 0.  one  obtains  Pc  < q @r||.  Furthermore.  (2)  yields: 

||f||  > || Lx ||  - £ Xj(F'(yj)  - L)x  ||  > (1  + q - S)  |x|| 

j=  i 

For  x,  y e Bs,  x - y e K,  k e [0,  1],  one  has  x 4-  /.(y  ~ x)  e Bs  and 

F(y ) - F(x ) = | F'(x  + /.(y  - x))(y  - x)  d).  e K 
• o 

In  view  of  the  closedness  of  K,  the  last  relation  follows  already,  if  the 
corresponding  Riemann  sums  are  in  K.  This,  however,  is  a consequence  of 
(3).  The  same  argument,  together  with  (3),  yields  the  inequality 

IIF(y)  - F(x)l  > (1  + q - d)||y  - x|| 

hence  P(y)  — F(x)  e K.  It  follows  inductively  that  P"(y ) - Fn(x)  e K as  long 
as  P*(x).  Fk{y)e  Bs  for  1 < k < n - 1.  On  the  other  hand, 

F"(y)  - F"(x)||  > (1  +q-6)n  |y  - x|| 
shows  that  Fk(y).  Fk(x)  e Bs  cannot  hold  for  all  k e N. 

Proof  of  Theorem  2.1.  Let  Bs  be  the  ball  of  the  preceding  lemma  and 
define 

An  = {x  e X | Fk(x)  < s for  k > u| 

The  sets  An  are  closed,  and  we  have 

N c |J  \ 

ne  K 
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It  remains  to  show  that  A„  is  nowhere  dense.  Assume  An  / 0 and 
Bp(x)  c An ; then 

(1)  FkBlt(x)  c:  Bs  for  k > n,  and 

(2)  (Fk)'{y)X  c X - X for  k > n,  y e Bp(x). 

Otherwise  there  exist  y e fi^(x),  z e A\  and  k > n with  0 ^ ( Fk)'{y)z  e K. 
Hence  there  is  a k e R such  that  y + ).z  e Bp(x)  and 

0 # Fk(y  + kz)  - Fk(y)  e K n B, 

which  contradicts  Lemma  2.2  by  (1).  This  proves  (2). 

(i)  The  interior  of  Fn(Bjx))  is  nonempty.  Let 

Br(y)  c=  F"(Bp(x))  and  y e F"(fl,,(x)) 

Since  K / 0 there  exists  a u / y with 

m e (y  + K)  n Br(y)  o Fn(Blt(x)) 

such  that  ||F*(y)||  < s and  ||F*(u)||  < s for  all  k e N and  u - ye  K leads  to 
a contradiction  by  Lemma  2.2. 

(ii)  Let  (e,,  ...,  em)czQX  be  linearly  independent  and  [ej  be  the 
corresponding  elements  of  X/(Fk)'(y)X,  ye  Bp(x).  By  (2)  one  obtains  from 

m m 

X = [0],  z=Z*jeje(X  -K)nQX 

7=1  7=1 

Hence  z = 0 and  all  Xj  = 0.  Therefore,  we  have 

dim  X/(Fk)'(y) X > dim  QX  = x 

contradicting  the  Fredholm  property. 

(iii)  We  show  by  induction 

(3)  dim  X/(Fmn)‘(y)X  > m for  all  ye  X,  me  N 
Setting 

Z(y)  = (Fmn)'(y),  V{y)  = (Flm~  1,n)'(Fn(y)) 

one  obtains  for  ye  flp(x)  by  the  chain  rule,  (1),  (2),  and  the  validity  of  (3) 
for  m — 1 : 

(4)  Z(y)X  e V(y)X  n (X  - R) 

According  to  (3)  of  the  proof  of  Lemma  2.2  one  gets  for  any  e e K : 

(5)  0 * b = V(y)e  e & n V(y)X 
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Let  ([u,]t.,  [«Jt),  cij€  X be  a basis  of  X/V(y)X  and  consider  [a,]. 

[u,]2,  [h],  as  elements  of  X/Z(y)X.  From 

I + P[bl  = [°]« 

j= i 

it  follows  by  (4)  that 

X OLjdj  + fib  e V(y)X  n(X  - K) 

7=1 

and  by  (5) that 

tocjOje  V(y)X 

7=  1 

Hence  a,  = 0.  Therefore. 

Iihe( X - K)n(Kv  {0}) 

implying  //  = 0.  Hence  we  have 

dim  X/Z(y)X  > dim  X/V(y)X  + 1 > m 

The  mapping  Z:  A'-*i!(A')  is  analytic  and  Z(y)  is  Fredholm  with 
ind  (Z(>))  = 0 for  all  ye  X.  Assertion  (3)  for  all  of  X follows  by  a theorem 
of  Gohbergand  Krein  [9,  p.  21]  (concerning  the  necessary  generalization  see 

[30]). 

3.  Positone  Problems 

The  selection  property  of  certain  iteration  schemes  can  be  demonstrated 
effectively  via  Theorem  2. 1 for  so-called  positone  problems.  These  are  roughly 
described  by  equations  leaving  invariant  a partial  order  of  the  given  Banach 
space  (for  the  nomenclature  cf.  [15]).  Because  of  their  practical  importance 
they  have  been  treated  extensively  (cf.  [8]  and  also  [12]).  Existence  and  the 
qualitative  behavior  of  “positive"  solutions  have  been  studied  in  [3,  14.  15], 
whereas  the  multiplicity  question  was  discussed  in  [2,  22,  23]. 

We  are  interested  in  the  convergence  of  a certain  iteration  scheme  and 
the  stability  properties  of  its  limits.  The  iteration  procedure  has  often  been 
used.  Bandle  [4]  was  the  first  to  point  out  that  a sequence  converging  from 
above  to  a nonminimal  solution  cannot  exist,  if  certain  conditions  are 
met.  We  owe  to  Amann  the  hint  that  this  can  be  concluded  already  from 
Theorem  F in  [2]  and  earlier  results  of  Laetsch  [23]. 
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Somewhat  related  to  our  results  are  those  of  Fujita  [7]  and  Sattinger 
[IX.  29],  which  show  the  connection  between  minimality  and  stability. 
Stability  is  understood  with  respect  to  solutions  of  a corresponding  evolution 
equation.  One  can  show  that  the  iteration  procedure  can  be  considered  a 
discrete-time  version  of  this  evolution  equation.  However,  the  discrete  case 
requires  stronger  assumptions.  We  shall  rely  on  Theorem  2.1  (iii)  and  thus 
on  the  analytic  case.  A special  application  follows  at  the  end  of  this  section. 
It  can  be  easily  extended  to  those  boundary  value  problems  whose  Green’s 
function  and  normal  derivative  are  positive. 

Let  (A"0,  .V,.  X 2)  be  a triple  of  real  Banach  spaces  with  X2  Xt.  In 
.V,  (resp.  A"0)  there  are  given  two  cones  Kt  (resp.  K0),  i.e.,  closed,  convex 
sets  with  y. Kj  c Kj  for  a e [R+  and  Kj  n ( — Kj)  - {0}.  We  use  the  notation 
Kj  = Kj  — [0},  Kj  for  the  interior  of  Kj  and  assume  that  k{  is  not  empty. 
In  Xj,  Kj  generates  a partial  order  consistent  with  the  algebraic  and 
topologic  structure  (cf.  [19]).  We  write  u < v if  v — ueKj,  u < v if 
v — ue  Kj,  u v if  v — it  e Kj. 

HI.  Let  A e V(X2,  X0 ) be  an  isomorphism  and  let  A~x  considered  a 
mapping  from  X0  into  Xx  be  compact  and  strongly  positive,  i.e., 
A xk0  c Kj.* 

H2.  Let  F:  Xx  ~*X0  with  F(0)  > 0 be  real  analytic  in  Xl  with  the 
properties: 

u < v implies  F(u)  < F(v),  u <&  v implies  F(u)  < F(v) 

We  look  for  solutions  it  e X 2 of  the  equation 

Ait  = XF(u),  A > 0 (3.1) 

or  equivalently  for  ue  X satisfying 

u = au  + (1  - <x)kA  lF(u)  = T(k,  it)  (3.1) 

for  some  a e (0,  1).  Obviously,  T(A,  • ) is  a real  analytic  Fredholm  map, 
and  '/  '(/,  it)  has  index  0 for  every  it  e X x (T'  denotes  the  Frechet  derivative 
with  respect  to  it).  The  spectrum  of  T except  for  the  point  a consists  of 
isolated  eigenvalues  of  linite  multiplicities.  If  A > 0,  it  < v implies  T(A,  it)  <§ 
T(A,  v)  and  T'(A,  »)/C,  cz  Kx.  spr  T'(A,  it)  is  less  (resp.  greater)  than  1 if  and 


* By  replacing  sequences  by  subsequences  in  the  proofs,  one  could  weaken  this  assumption 
to  A "K0  c Kt  for  some  n e N. 
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only  if  spr  //I  lF'(u)  is  less  (resp.  greater)  than  1.  It  is  independent  of  a.  If 
0 is  the  only  spectral  point  of  kA  ~ 'F'(u),  then  for  both  operators  the  spectral 
radius  is  less  than  1.  Otherwise,  kA~'F'(u)  possesses,  according  to  [21, 
Theorem  6.1],  a positive  eigenvalue  1/A0  of  maximum  modulus.  A simple 
calculation  verifies  the  assertion.  

A solution  u of  (3.1')  is  called  weakly  stable  if  a (T'(A,  u))cfi1(0) 
holds;  u is  called  stable  if  a(T'(k,  u))  c Bf 0)  holds.  A nonweakly  stable 
solution  is  called  unstable. 

To  justify  this  definition  we  point  out  that  if  all  eigenvalues  of  A~1F'(u ) 
are  real,  u is  stable  if  the  spectrum  of  id  — XA~  iF'(u)  is  positive.  A physically 
relevant  criterion  of  stability  is  a(A  - AF(u))  <=  C + . It  is  well  known  for 
ordinary  differential  equations  that  this  criterion  guarantees  the  asymptotic 
stability  of  u with  respect  to  solutions  of  du/dt  + Au  — AF(u)  = 0.  Moreover, 
a(A  — AF'(u))  c C+  is  a necessary  condition  for  Lyapunov  stability.  General- 
izations to  parabolic  equations  are  given  in  [16]. 

In  general,  very  little  can  be  said  about  the  location  of  cr(A  — AF(ti))  if 
a(id  - /.A  lF'(u ))  is  known,  and  vice  versa.  If,  however,  the  spectrum  of 
A — AF(u)  is  real  and  if  HI  and  H2  are  satisfied,  it  can  be  readily  seen 
by  means  of  Theorem  6.3  in  [21],  that  the  numerical  and  physical  notions 
of  stability  coincide  for  small  (A  |. 


Lemma  3.1.  Assume  HI  to  hold,  let  F:  X i -*•  X0  be  real  analytic,  and 
u e X 2 be  a solution  of  (3.1').  If  T"(A,  u)  converges  to  u for  all  u in  an  open 
set  of  X u then  u is  weakly  stable. 

For  the  proof  observe  that  for  fixed  A,  G(v)  = T(A,  u + v)  — T(A,  u ) satisfies 
the  assumptions  of  Theorem  2.  l(iii);  T"(A,  u)  converges  to  u if  G"(r)  converges 
to  0,  and  vice  versa. 

A sequence  ( u„ ) in  X0  is  called  bounded  from  above  (below)  if  there  is 
a u e X0  with  un  < u (u„  > u)  for  all  neN.  We  call  K0  weakly  regular  if 
every  monotonically  increasing  (decreasing)  sequence  that  is  bounded  from 
above  (below)  converges  weakly  in  X0.  u0  e X , satisfying  u0  P T(A,  u0) 
[u0  <?  T(A,  u0)]  is  called  a strong  supersolution  (subsolution). 


Theorem  3.2.  Let  HI,  H2,  A >0,  be  satisfied,  and  let  K0  be  weakly 
regular.  If  u0  is  a positive,  strong  supersolution  then  T"(A,  u0)  converges  to 
a weakly  stable  solution  u of  (3.1)  with  u > 0. 
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Proof.  The  sequence  un+1  = T(A,  u„ ) consists  of  positive,  strong  super- 
solutions and  un  un+1  holds.  Hence,  F(un)  > 0 decreases  monotonically.  In 
view  of  the  weak  regularity  of  K0 , F(un ) converges  weakly  in  X0.  Since 
A x is  compact,  /.A~lF(un)  converges  strongly  in  X x.  Hence,  un+1  - <xun 
converges  strongly  in  X x,  m e (0,  1 ),  which  implies  the  convergence  of  u„  to 
a solution  u.  For  the  weak  stability  of  u,  observe  that  the  set  of  strong 
supersolutions  isopen.  Let  U(u0)  <=  X x be  a neighborhood  of  strong,  positive 
supersolutions  of  u0,  [u,  u0]  = {u  e Xx  | u ^ u < u0)  and  Q = U(u0 ) n [u,  u0]; 
then,  as  will  be  shown,  T"(A,  u)  converges  to  u for  every  ue  Q,  and  0 is 
nonempty;  the  assertion  follows  via  Lemma  3.1. 

For  t'o  e Q the  sequence  vn+1  = T(k,  tn)  converges  to  some  v and 
M < Vn  < un  holds  for  all  n e N.  Therefore,  ii  = v.  The  interior  of  [ti,  u0]  is  non- 
empty since  u j belongs  to  it.  For  small  positive  a,  ua  = (l-a)u0 
+ au,  e U{u0)  n [m,  u0].  Hence,  ft  is  nonempty.  Q.E.D. 

Corollary  3.3.  Let  HI,  H2,  / >0  be  satisfied,  and  let  K0  be  weakly 
regular.  If  u0  is  a strong  subsolution  then  the  sequence  Tn(/„  u0 ) is  either 
unbounded  or  converges  to  a weakly  stable  solution  u of  (3.1)  with  uQ  <$  u. 

The  sequence  T"(/,  u0)  never  converges  to  an  unstable  solution,  if  u0  is  a 
strong  super-  or  subsolution.  Naturally,  all  stable  solutions  are  local 
attractors. 

A solution  ii  of  (3.1)  is  called  minimal  if  there  is  no  solution  u with 
u < a.  if  F( o)  > 0,  hence  T(/„  0)  0 for  /.  > 0,  holds,  then  0 is  a strong 

subsolution  and  there  is  at  most  one  positive  minimal  solution.  In  addition, 
this  solution  is  weakly  stable.  If  F(u)  > 0 for  all  u e X v then  by  using  the 
implicit  function  theorem  one  concludes  that  a positive,  minimal  solution 
exists  for  /.  in  some  interval  (0,  A0)  for  which  1 is  not  an  eigenvalue  of 
T'(k,  ii(A)).  There  ii(A)  is  stable  and  T"(A,  0)  converges  to  it  from  below. 

If  in  addition  to  H2.  F is  strictly  convex,  i.e., 

F(liu  + (1  - P)v)  < pF(u ) + (1  - P)F(v)  (3.2) 

holds  for  u / v,  fie  (0,  1),  then  every  nonminimal  solution  ii  is  unstable, 
whenever  1 i a(T(A,  ii)).  Namely,  let  u <u  be  solutions  of  (3.1);  u0"  = fiu 
+ (1  — P)u  is  a strong  supersolution  for  // g (0,  1). 

The  sequence  7"(A.  u0")  converges  from  above  to  uj1,  ii  < uj1  < ii.  Thus, 
in  every  neighborhood  of  ii  there  are  u0  such  that  7’"(A,  u0)  does  not 
converge  to  ti.  Hence,  it  is  not  stable.  Since  7”(A,  ii):  Kx  -*  Kx,  there  is  a 
positive  eigenvalue  of  maximal  modulus  [21.  Theorem  6.1]  that  has  to  be 
greater  than  1. 
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If  F is  strictly  concave  [(3.2)  holds  in  reversed  order],  then  every  minimal 
stable  solution  it  is  maximal  even  in  the  class  of  subsolutions.  Since  u < u, 
and  u a subsolution  of  (3. 1 ')  implies  that  /lu  + ( 1 - /i)u  is  a strong  subsolution 
for  every  /i.  The  argument  proceeds  as  above. 

Corollary  3.4.  Let  HI,  H2  be  satisfied,  and  let  K0  be  weakly  regular. 

(i)  IfF(O)  > 0,  there  exists  at  most  one  positive,  minimal  solution  and 
it  is  weakly  stable. 

(ii)  If  F is  strictly  convex  in  Xq,  then  every  nonminimal  solution  u is 
unstable  if  1 is  not  an  eigenvalue  of  T'(A , u). 

(iii)  If  F is  strictly  concave  in  X u then  every  stable,  minimal  solution 
is  maximal,  even  among  the  class  of  subsolutions. 

As  an  application  we  consider  a semilinear  Dirichlet  problem  describing, 
e.g.,  the  temperature  distribution  in  a chemically  active  gas  [8]  (for  other 
applications  see  [ 1 2]).  D cz  IK"  denotes  a bounded  domain  with  a C2-boundary 
and  A the  n-dimensional  Laplacian.  Choose  p>  n,  X0  = Lp(D ),  X,  = 
C'(D)  = {ue  C‘(D)|u(x)  = 0 for  x e dD\, 

X 2 = Wp2(D)  = {u  e Wp2(D)\u(x)  = 0 for  x e vD\. 

Wp  can  be  considered  a compact  subset  of  C'(D).  A:  u - Am  is  a 
topological  isomorphism  between  \Vp2  and  Lp.  Hence,  A 1 : Lp  -*  (4(D)  is 
compact.  A 1 has  a representation  by  the  Green  function  G that  satisfies 
G > 0 in  D x D and  dG/dv  > 0 in  dD  x D (cf.  [26.  p.  85]);  v denotes  the 
interior  unit  normal.  Let  K0  = {u  e Lp(D)\u(x)  > 0 a.e.  in  D),  Kt  = 
\ueC'(D)  |u(x)  > 0,  x e D).  Then  A 1 is  strongly  positive,  since  for  every 
u e K()  we  have  (A  ‘»)(x)  > 0 for  x e D and  (d(A~  lu)/dv)(x)  > 0 for  x e dD. 
K()  is  weakly  regular  in  view  of  [10,  Theorem  13.44], 

If  F satisfies  H2  then  all  conclusions  of  this  section  are  true;  e.g., 
F(u)  = eu  implies  the  existence  of  a unique,  positive,  stable  minimal  solution 
ii(x)  in  some  interval  (0,  /0)  [3].  For  u0  sufficiently  close  to  0,  7’"(A,  u0)  con- 
verges to  u(/).  Every  nonminimal  solution  is  unstable  if  1 is  not  an  eigen- 
value of  7'(x,  u).  There  exists  no  strong  supersolution  U with  U > u. 


4.  Bifurcation 

In  this  section  we  describe  selection  properties  for  some  specific  bifurcation 
phenomena,  namely,  when  F,  in  addition  to  H2,  satisfies  F(0)  = 0 and  is 
either  concave  or  convex,  u = 0 is  always  a solution  of  (3.1')  (the  trivial 
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solution ).  Under  suitable  hypothesis  there  exists  a positive,  simple  character- 
istic value  A0  of  minimal  modulus  of  A lF'( 0).  The  nontrivial  solutions 
of  (3.1)  emanating  in  A0  form  an  analytic  branch  that  is  stable  for  A > A0 
and  unstable  for  / < A0  (cf.  [5]).  Here  we  show  that  the  sequence  Tn(X,  w0), 
for  certain  uQ , always  converges  to  the  stable  solution.  Moreover,  one 
easily  obtains  existence  and  uniqueness  results. 

For  strictly  concave  F and  u > 0,  a e (0,  1),  one  has  F(nu)  > a F(u)  > 0. 
Hence,  F'(0):  K()  and  T'(A,  0):  Kx  -*  Rx  for  A > 0.  Theorem  6.3  in 

[22]  guarantees  that  there  is  a positive,  simple  characteristic  value  A0  of 
minimal  modulus  of  A lF'( 0).  The  corresponding  eigenfunction  satisfies 
<p  0.  Using  [5]  one  obtains  u = 0,  A = A0  as  a point  of  bifurcation;  the 
nontrivial  branch  forms  locally  a real  analytic  curve. 

Theorem  4.1.  Let  HI,  H2  be  satisfied,  and  let  K0  be  weakly  regular. 
In  addition,  assume  F(0)  = 0,  F strictly  concave.  Then,  A 'F'( 0)  possesses 
a positive,  simple  characteristic  value  A0  of  minimal  modulus.  Equation  (3. 1) 
has  no  positive  solution  in  [0,  A0].  Nontrivial  positive  solutions  exist  in  a 
right-neighborhood  of  A0  ; they  form  a real  analytic  branch  u(A)  with 

u(/r0)  = 0. 

The  trivial  solution  is  stable  in  (0,  A0)  and  unstable  for  A > A0,  where 
«(/)  is  stable.  The  sequence  7"(A,  un)  converges  to  0 if  u0  > 0 is  a strong 
supersolution  and  if  A e [0,  /„);  in  some  interval  (A0,  /,)  the  sequence 
converges  to  u(/). 

Proof.  The  bifurcation  picture  near  /0  follows  directly  from  [21  ] and  [5]. 
For  global  existence  of  the  positive  branch  see  [6].  u = 0 is  a stable  solution 
of  (3.1)  in  the  interval  [0,  /0).  Assume  that  U > 0 is  an  additional  solution 
for  some  A e (0,  A0).  In  view  of  the  concavity  of  F,  one  obtains  7 (A,  /fU) 
//T(A,  U)  = liU  ^>0  for  /i  e ( 0,  1).  Hence,  fill  is  a positive,  strong  sub- 
solution, and  7 "(A,  / iU ) defines  a monotonically  increasing  sequence  of 
strong  subsolutions.  This  yields  a contradiction  to  the  stability  of  0 since 
/ill  can  be  chosen  arbitrarily  close  to  0. 

The  same  argument  shows  that  there  is  no  positive,  strong  subsolution 
in  [0,  A0)-  Now,  assume  U > 0 to  be  a solution  of  (3.T)  for  A = A0.  Then 
T(A0,  liU)  £>  /iU  holds  for  /ie  (0,  1).  For  fixed  /f,  /iU  is  a positive,  strong 
subsolution  for  A < A0,  yielding  a contradiction. 

u = 0 is  unstable  for  A>A0,  since  A/A0  is  the  eigenvalue  of  maximal 
modulus  for  A 1 F'( 0).  According  to  [5],  ti(A)  is  stable  for  sufficiently  small 
A — A0 . Moreover,  m(A)  is  a minimal,  positive  solution.  In  view  of  its  stability 
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it  is  maximal  as  well  among  all  positive  solutions  (same  argument  as  above). 
The  sequence  T"(A,  u0)  converges  to  u(A)  from  above  if  u0  is  a positive, 
strong  supersolution  and  if  A is  sufficiently  close  to  A0 . Q.E.D. 

For  positive,  strong  subsolutions  u0,  which  are  small  in  norm,  A > A0, 
T"(A,  u0)  converges  to  ii(A)  from  below.  Without  the  restriction  for  the 
norm  the  sequence  may  be  unbounded. 


Theorem  4.2.  Let  HI,  H2  be  satisfied,  let  K0  be  weakly  regular,  and 
let  F be  strictly  convex,  F(0)  = 0,  F'(0):  /C,  -♦  K0.  Then  A ~lF'(0)  possesses 
a simple,  positive  characteristic  value  A0  of  minimal  modulus.  For  A > A0 
Eq.  (3.1)  has  the  unique  solution  u — 0 in  u = 0 is  stable  for  A e (0,  A0) 
and  unstable  for  A > A0 . 

The  positive  solutions  branching  off  at  A = A0  exist  for  A in  a left- 
neighborhood  of  A0.  They  form  a real  analytic  curve  m(A).  In  the  class  of 
positive  solutions  u(A)  is  minimal  and  maximal;  it  is  unstable. 

There  are  no  positive,  strong  supersolutions  for  A > A0 . The  sequence 
T"(A,  u0)  is  unbounded  if  u0  is  a positive,  strong  subsolution  and  A > A0. 
For  A < A0  and  A0  — A sufficiently  small,  there  are  no  positive,  strong  sub- 
solutions that  are  small  in  norm;  the  sequence  T"(A,  u0)  converges  to  0 if 
u0  is  a positive,  strong  supersolution. 

Proof.  The  proof  of  the  spectral  properties  and  the  bifurcation  behavior 
proceeds  as  in  Theorem  4.1.  If  A > A0  is  different  from  all  characteristic 
values  of  A~  *F(0)  and  if  U > 0 is  a solution  of  (3.T)  then,  in  view  of  the 
convexity  of  F , [iV  is  a positive,  strong  supersolution  for  /i  e (0,  1).  The 
sequence  T"(A,  /1U)  converges  to  a solution  l/t  with  (UJ  $>  (J r $>  0 (0  is 
unstable).  Since  u = 0 is  isolated  one  obtains  a contradiction.  The  same 
argument  excludes  the  existence  of  strong,  positive  supersolutions. 

If  A coincides  with  a characteristic  value  A;  of  /F'F(O).  then  [iU  is  a 
positive,  strong  supersolution  for  A > Aj . Hence,  0 is  the  unique  solution  of 
(3.1)  for  A > A0.  The  stability  properties  of  the  trivial  solution  are  obvious. 

The  branching  solution  ii(A)  is  unstable  [5]  and  isolated  for  A < A0 , A close 
to  A0.  Among  the  positive  solutions  it  is  minimal.  If  U > »(A)  is  another 
solution  of  (3.1)  then  u0  - fiU  + ( 1 - [i)u( A)  is  a positive,  strong  super- 
solution and  T”(A,  u0)  converges  to  a solution  v of  (3.1)  with  m(A)  < r u0  . 
Since  fi  can  be  chosen  small,  one  obtains  a contradiction  to  the  instability 
and  isolation  of  u(A).  Hence,  u(A)  is  maximal. 
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If  U is  a positive,  strong  supersolution,  / iU , [i  > 0,  is  also  one.  The 
sequence  7"(A,  u0),  A > A0,  converges  from  above  to  some  v,  fill  £>  v £>  0. 
This  yields  a contradiction  to  the  instability  and  isolation  of  u = 0 for 
A t £ /.j . For  A = kj  the  argument  proceeds  as  above.  The  unboundedness  of 
7n(A,  n0),  A > A0,  follows  from  the  uniqueness  of  the  trivial  solution,  if  u0  is 
a positive,  strong  subsolution. 

Choose  A < A0  so  that  u(A)  is  the  minimal,  positive  solution  for  A e (/,  A0). 
Since  0 u0  <3  u(k)  holds  for  positive,  strong  subsolutions  u0  that  are  small 
in  norm,  7n(A,  u0)  converges  to  u(A)  from  below  for  A e (A,  A0),  a contradiction 
to  the  instability  of  u(A).  The  convergence  of  T"(A,  u0)  to  0 follows,  for 
positive,  strong  supersolutions  u0,  from  the  maximality  and  the  instability 
of  u(A).  Q.E.D. 

We  close  this  section  with  some  remarks  on  local  selection  properties  of 
theTteration  scheme  near  the  bifurcation  points  generated  by  a simple  real, 
or  a pair  of  simpte^purely  imaginary  eigenvalues  of  the  linearization  (for 
proofs,  see  [30]).  Let  X be  a real  Banach  space,  V a neighborhood  of  0 
in  IR,  and  7:  V x X -*  X a Cx-mapping.  We  denote  by  D;  (resp.  Du ) the 
partial  derivatives  with  respect  to  A (resp.  u).  Moreover,  T(A,  0)  = 0 should 
hold  for  all  A e IR.  We  consider  the  equation 

u = T(k,  u)  (4.1) 

Let  1 be  a simple  eigenvalue  of  Du  7(0,  0),  </>  the  normed  eigenfunction,  Q 
the  corresponding  Dunford  projector,  and  P = id  — Q.  Furthermore,  the 
following  notations  are  used: 

Y = PX , K'  = Y+  IR>,  K~  = Y+R-(p 

Assume  that  DU)T( 0,  0 )(peK\  DJT(k,  0)  = 0,  j = 2 n,  and 

(a)  Dnu  1 1 7(0,  0 )(pe  K , (b)  D"u  + 1 7(0,  0)q>  e K + holds  for  sufficiently  small 
| A | . Then  u = 0,  A = 0 is  a point  of  bifurcation.  If  n is  odd,  there  exists 
exactly  one  nontrivial  branching  solution  of  (4.1)  for  every  sufficiently  small 
| A | . If  n is  even,  then  in  case  (a),  A > 0,  and  case  (b),  A < 0,  there  exists 
exactly  one  nontrivial  branching  solution  u + (A)  e K + [resp.  u~(A)  e K~\ 
Analogous  results  with  a slightly  different  notion  of  simplicity  are  proved 
in  [5]  (cf.  [24]  also).  The  proof  of  the  present  assertion  uses  the  center- 
manifold  theorem  of  Ruelle  and  Takens  [27], 

For  the  selection  property  of  the  iteration  scheme  one  needs  in  addition 
that  7 is  a real  analytic  Fredholm  operator  with  index  0 and 
dim  X/Du  7(A,  0)"A'  < n0(^)  < for  all  neN.  The  trivial  solution  u = 0 is 
stable  for  A < 0;  the  sequence  T"(A,  u0)  converges  to  0 for  small  j|u0||.  If 
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A > 0,  the  trivial  solution  is  unstable  and  the  set  N = {x  e X/T"( A,  x)  -»()j 
is  according  to  Theorem  2.1  of  first  category.  If  a nontrivial  branching 
solution  u ' (A)  e K f [resp.  u (A)  e K ] exists,  then  for  small  u0|j,  / "(A,  u0) 
converges  in  a set  of  second  category  to  k'(A)  [resp.  u (A)]  depending  on 
whether  u0  e K ' or  u0  g K holds.  Hence,  these  solutions  are  weakly  stable. 

If  at  A = 0 a pair  of  conjugate  complex,  simple,  nonreal  eigenvalues  of 
Du  7 (A,  0)  leaves  the  closed  unit  circle  with  nonvanishing  “velocity,”  and  if 
all  other  spectral  points  remain  for  small  |A|  in  the  open  unit  circle,  then 
according  to  [27]  there  exists  a two-dimensional  attractive,  invariant 
manifold  M;  of7  (A,  •)  in  a neighborhood  F(0)of0.  Under  certain  additional 
assumptions  M x contains  a closed  attractive  curve  Sx , which  contains  0 in 
its  interior.  Then  un  = 7’"(A,  u0)  converges  to  Sx  for  almost  all  tt0  of  a 
neighborhood  of  0,  i.e.,  the  distance  d(un,  ,S'J  converges  to  0.  For  A < 0 
the  sequence  converges  to  the  stable  solution  u = 0. 
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Introduction 

One  of  the  purposes  of  this  symposium  is  to  discuss  the  effect  of  techniques 
of  ordinary  differential  equations  and  the  point  of  view  of  dynamical  systems 
on  problems  in  partial  differential  equations.  This  is  a broad  topic  and  the 
author  is  competent  to  discuss  only  small  parts  of  it.  Perhaps  the  most 
obvious  observation  about  the  theory  of  ordinary  differential  equations 
vis-a-vis  partial  differential  equations  concerns  a striking  difference.  There 
are  a few  basic  theorems  about  ordinary  differential  equations  concerning 
the  existence,  uniqueness,  and  dependence  on  given  data  of  solutions  of 
initial-value  problems.  Among  other  things  these  theorems  guarantee  a rich 
supply  of  solutions  for  most  ordinary  differential  equations,  and  provide 
convenient  parametrizations  of  the  solutions  of  a given  equation  and  basic 
information  used  throughout  the  development  of  more  special  topics  and 
results.  The  theory  of  partial  differential  equations,  on  the  other  hand,  does 
not  have  a few  unifying  basic  results.  Instead,  it  splits  immediately  into 
many  topics,  each  with  its  own  fundamental  theorems  and  methods.  Indeed, 
even  the  problem  of  identifying  those  linear  partial  differential  equations 
that  admit  any  solutions  at  all  is  quite  deep  and  solved  only  in  special  cases. 

However,  most  problems  in  partial  differential  equations  arising  from 
physical  models  either  have  the  form  of  evolution  equations,  which  describe 
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the  change  of  a physical  system  in  time,  or  result  from  seeking  stationary 
solutions  of  some  evolution  problem.  These  evolution  problems  can  often 
be  regarded  as  ordinary  differential  equations  in  some  infinite-dimensional 
space. 

The  principal  goal  of  this  chapter  is  to  describe  and  hopefully  to  render 
more  accessible  some  recent  results  concerning  abstract  Cauchy  problems 
in  infinite-dimensional  spaces  that  are  close  in  spirit  to  classical  ordinary 
differential  equations.  Moreover,  they  cover  a broad  range  of  interesting 
problems  in  partial  differential  equations.  In  accordance  with  this  goal,  it 
is  assumed  that  the  reader  is  familiar  with  basic  functional  analysis  and 
ordinary  differential  equations,  but  not  partial  differential  equations,  Sobolev 
spaces,  or  the  Hille-Yosida  theorem. 

Section  1 is  completely  informal  and  is  intended  mainly  to  make  the 
material  in  Section  2 seem  more  natural  than  it  might  otherwise  appear. 
Section  2 describes  basic  abstract  existence  and  uniqueness  results  for  the 
Cauchy  problem  in  our  setting.  These  results  are  discussed  for  the  case  of 
general  Banach  spaces,  which  adds  greatly  to  their  complexity.  However,  it 
is  this  case  we  wish  to  emphasize.  (The  theory  in,  e.g.,  Hilbert  spaces  is 
cleaner  and  much  more  highly  developed.  If  the  reader  prefers  to  begin 
with  this,  we  recommend  the  book  of  Brezis  [11].)  Section  3 is  intended  to 
provide  a hint  of  the  scope  and  flavor  of  applications  of  the  abstract 
existence  theory  to  concrete  problems  in  partial  differential  equations. 
Section  4 discusses  briefly  the  question  of  continuous  dependence  on  the 
equation  of  the  solutions  described  in  Section  2 and  other  perturbation 
questions.  Section  5 indicates  another  setting  for  ordinary  differential  equa- 
tions in  infinite  dimensions,  which  corresponds  to  problems  in  partial 
differential  equations  and  the  relationship  of  this  setting  to  that  of  Section  2. 
Section  6 contains  most  of  the  references  and  various  further  remarks.  It 
may  be  consulted  while  reading  Sections  1-5  and  is  intended  to  help  make 
the  literature  more  accessible. 

There  are  two  appendixes.  In  Appendix  1 an  estimate  is  established  that 
contains  a crucial  part  of  the  proof  of  the  existence  results  described  in 
Section  2.  It  may  be  used  in  the  problems  introduced  in  Section  5 as  well. 
Kaplan  and  Yorke  [30]  announce  results  that  apply  to  problems  of  the 
forms  described  in  Sections  2 and  5 simultaneously,  and  the  result  in 
Appendix  1 is  a clarification  and  sharpening  of  a result  of  theirs. 

Appendix  2 contains  an  important  estimate  due  to  Benilan  [3],  It  is 
included  for  the  reader’s  convenience. 
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1.  Orientation 

Let  us  begin  by  sketching  one  way  in  which  a problem  in  partial 
differential  equations  may  be  transformed  so  that  it  has  the  appearance  of 
an  initial-value  problem  for  an  ordinary  differential  equation.  For  purposes 
of  illustration,  we  choose  the  nonlinear  parabolic  equation 

*,U  (f,  x)  - Au(f,  x)  + g(u(t,  x))  = 0,  / > 0,  x e ft,  (NPE) 

ct 

where  A = ]T,%= , ( d/dx ,)2,  ft  is  an  open  subset  of  R*  with  boundary  eft, 
and  g:  !R  -* IK.  In  addition  to  (NPE)  we  impose  the  boundary  condition 

u(t,  x)  = 0 for  t > 0,  x € eft,  (BC) 

and  the  initial  condition 

u(0,  x)  = u0(x)  for  x € ft,  (IC) 

on  the  unknown  function  u.  The  first  step  is  to  choose  a Banach  space  X 
of  real-valued  functions  on  ft  [e.g.,  Lp(ft)]  and  interpret  the  unknown  function 
u(t,  x)  as  the  map  t -*  u(t,  ■ ),  which  is  to  have  values  in  X.  The  function 
t -*  u(t , •)  is  denoted  simply  by  u(t).  Then  (du/8t)(t,  x)  can  be  thought  of  as 
(du/dt)(t),  the  derivative  of  the  X-valued  function  u(t).  Next  we  define  an 
operator  A : D(A)  c X -*  X by  Au  = — Au  + g(u ) for  u e D(A),  where  D(A) 
must  be  chosen  so  that  the  expression  Au  = —Au  + g(u)  has  (some  sort  of) 
a meaning  and  lies  in  X for  u 6 D(A).  Moreover,  the  boundary  condition 
(BC)  should  be  built  into  D{A)  by  requiring  (in  some  sense)  that  u e D(A) 
implies  u = 0 on  c’ft.  Then  (NPE),  (BC),  (IC)  can  formally  be  written  as 
the  abstract  Cauchy  problem 

(DE)  (du/dt)  + Au  = 0,  (IC)  m(0)  = u0 . (ACP) 

Several  comments  are  in  order  at  this  point.  To  go  from  (NPE),  (BC),  (IC) 
to  (ACP)  we  introduced  only  one  space  X and  an  operator  A : D{A ) ^ x -*  X. 
There  are  other  ways  to  convert  problems  in  partial  differential  equations 
to  problems  that  look  like  (ACP)  but  that  involve  more  spaces.  In  particular, 
u and  du/dt  may  take  values  in  different  spaces.  (Sec  the  chapter  by  Tartar 
in  this  volume.)  We  will  mostly  be  concerned  with  a “one-space”  framework, 
i.e..  A:  D(A )e  X -*  X,  which  lies  closest  to  classical  ordinary  differential 
equations.  Let  us  say  the  reduction  of  a problem  to  the  form  (ACP)  is 


134 


MICHAEL  G.  CRANDALL 


“successful”  if  some  abstract  results  can  then  be  applied  to  obtain  informa- 
tion about  the  original  problem.  In  general,  successful  reductions  are  not 
unique  even  if  they  exist  and  before  one  can  make  any  successful  reduction, 
he  must  know  a great  deal  about  his  original  problem.  For  simplicity, 
let  Q = R'v  in  our  example.  Then  the  choices  X = BU(UN)  (bounded 
uniformly  continuous  functions  on  R*)  or  W(UN),  1 < p < oo,  and  D(A)  = 
{u  e X : A u,  g(u)  e X)  with  Au  = — Au  + g(u)  are  successful  for  (NPE),  (IC) 
provided,  e.g.,  that  g is  diffeuntiable  and  g'  is  bounded  below.  (Indeed,  in 
these  cases  the  theory  we  describe  in  Section  2 applies.)  Here  Au  is  under- 
stood in  the  sense  of  distributions.  In  this  example,  we  see  characteristic 
features  of  (ACP)  corresponding  to  its  origin  in  a problem  in  partial 
differential  equations:  X is  infinite  dimensional,  D(A)  is  small  in  the  sense 
of  category,  and  A is  very  discontinuous.  These  features  preclude  direct 
use  of  some  of  the  standard  methods  from  ordinary  differential  equations 
(Picard  iterations,  polygonal  approximations)  for  studying  the  existence  of 
solutions  of  (ACP),  since  these  methods  involve  successive  applications  of  A. 

Thus  consider  instead  the  implicit  (backwards)  difference  scheme 


«e( 0 ~ ~ £) 

e 


+ A ue(t)  = 0 
u£(f)  = u0 


for  t > 0, 
for  t < 0, 


(BD) 


where  e > 0.  Not  worrying  about  justifications  at  this  point,  we  formally 
solve  (BD)  as  follows:  One  has  u£(f)  + eAue(t)  = ue(t  — e),  or  ue(t)  = 
(/  + eA ) - ' ue(t  - e).  Let  Je  = (I  + eA)~  1 and  iterate  to  find 

ut(t)  = 4'/e,+  1”o  ^ t>  0,  (1.1) 


where  [r]  is  the  largest  integer  in  ( — oo,  r].  This  looks  promising,  since  even 
if  A is  a differential  operator,  Jt  = (/  + eA)~  1 may  be  well  behaved.  (BD)  is 
called  an  “implicit  scheme”  because  it  leads  to  formulas  involving  the 
inverses  (/  + eA  )"  1 = Jt . Ignoring,  for  the  moment,  our  previous  motivation, 
let  us  consider  properties  of  A that  might  be  favorable  to  the  existence  of 
the  limit  limc  l0ut.  For  one  thing,  it  will  be  necessary  to  have  u£  defined, 
which  requires  the  iterates  Jek  of  Jf  to  have  a large  enough  domain.  Hence 
D(Jt ) = R(1  + eA)  should  be  large  enough.  Moreover,  it  seems  reasonable 
to  ask  that  the  iterates  Jk  be  equicontinuous.  The  simplest  way  to  guarantee 
this  is  to  ask  that  J,  = (/  + eA)~  1 be  a contraction  [i.e.,  ||Jtx  — J£y||  < 

||  x — yj|  for  x,  ye  D{Jr )]  for  e > 0.  It  is  a happy  fact  that  under  these  rather 
minimal  assumptions  lim£i0  ujt)  will  exist  for  suitable  u0.  Moreover,  the 
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assumptions  are  satisfied  in  a diverse  collection  of  problems  from  partial 
differential  equations. 

In  the  next  section  we  will  state  more  precisely  the  result  hinted  at  above. 
The  point  here  is  that  the  implicit  scheme  (DE)  leads  naturally  to  the 
assumptions  made  and  provides,  in  fact,  a mnemonic  device.  One  further 
thing  we  would  like  to  motivate  by  these  considerations  is  the  following: 
If  /is  a function,/-1  need  not  be.  Similarly,  if  (/  + i:A)~  1 is  a function, 
A need  not  be.  Thus  A will  be  allowed  to  be  “multivalued,”  a generality 
very  useful  in  applications. 

Finally,  let  us  remark  that  attempting  to  solve  (ACP)  via  the  backwards 
difference  scheme  (BD)  is  but  one  of  many  possible  approaches.  This  is 
clear  from  the  chapter  by  Tartar.  However,  in  our  context,  (BD)  appears 
a natural  way  to  introduce  a theory  rich  in  structure  and  with  many 
applications,  and  not  simply  one  of  a multiplicity  of  possibilities. 


2.  Accretive  Operators,  the  Cauchy  Problem,  and  Semigroups 

For  convenience,  we  will  call  a mapping  A : X -*  2*  (the  set  of  subsets  of 
A")  an  operator  in  X.  Let  D(A)  = {x  e X : Ax  ^ 0).  An  operator  A is  single 
valued  if  Ax  is  a singleton  for  x e D(A),  and  in  this  case  we  use  Ax  to  denote 
both  the  singleton  and  its  element.  In  this  way  functions  A:  D(A)  c X -*■  X 
are  identified  with  single-valued  operators.  Given  operators  A,  B,  and  a s R, 
A + B,  a A,  and  A~  1 are  defined  in  the  obvious  ways: 

(A  + B)(x)  = Ax  4-  Bx,  A~  1 y = {x  € X : y € Ax}, 

(aA)(x)  = a(Ax)  for  x,  yeX.  s 

One  also  sets  R(A)  = (J  {Ax  : x e D(A)}.  The  identity  map  of  X is  denoted 
by  /,  the  norm  by  ||  ||. 

Definition.  Let  A be  an  operator  in  X.  Then  A is  accretive  if  (/  + r,A)~  1 
is  a contraction  for  r.  > 0. 

Let  us  unravel  the  terminology:  An  operator  J in  X is  a contraction  if 
x,  e Jz,,  i = 1,  2,  implies  ||xj  — x2||  < || z,  — z2||.  (Observe,  taking  z,  = z2, 
that  contractions  are  single  valued.)  Now  x,  e (I  + t:A)~ 'z,  means 
z,  € (/  + t:A)x,  or  z , = x,  + cyt  for  some  y,  e Ax, . Thus  (I  + r,A)~ 1 is  a con- 
traction if 


|(x,  + cy,)  - (x2  -l-  ey2)||  > ||x,  - x2 


for  y,  e Ax,,  i = 1,  2.  (2.1 ) 
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Therefore,  A is  accretive  iff  (2.1)  holds  for  t > 0.  There  is  yet  a third  way 
to  say  that  A is  accretive.  Let  [ , ]A:  X x X -*  R be  defined  for  A # 0 by 

[x,y]A  = (||x  + Ay||- |(x|()/A. 

Since  A -►  ||x  + Ay||  is  convex  in  A,  [x,  y]x  is  nondecreasing  in  A,  and  we  may 
define  [ , ]±'-  X x X ->  R by 

[x,  y]  + = lim  [x,  y]A  = inf[x,  y]x, 

/io  x>o  (2.2) 

[x,  y]_  = lim  [x,  y]x  = sup  [x,  y]x . 

GO  /<0 

By  (2.2),  || x + Ay ||  > ||x||  for  A > 0 iff  [x,  y]+  > 0.  Thus  A is  accretive  iff 
[x,  - x2,  y,  - y2]  + > 0 for  6 Axt-  (2-3) 

The  result  alluded  to  in  the  previous  section  is  stated  next. 


Theorem  1.  Let  A bean  accretive  operator  in  X and  R(l  + kA)  3 D(A ) 
for  A > 0.  If  Jt  = (/  + v.A)~  \ T > 0,  and  x e D(A),  then 

S(f)x  = lim  J*,/£l+,x  (2.4) 

GO 


exists  uniformly  for  0 < t < T.  Moreover,  the  S(f)  so  defined  is  a semigroup 
of  contractions  on  D(A).  That  is, 


(i)  S(t):  D(A)  -*  D(A) 

(ii)  S(/)S(r)  = S(t  + t) 

(iii)  lim  S(f)x  = S(0)x  = x 

(iv)  ||S(t)x  - S(f)y||  < 1 x - y|| 


for  t > 0; 
for  f,  t > 0; 
for  xeD(A); 

for  x,yeD(A),  t > 0. 


Theorem  1 is  proved  in  [22].  The  proof  is  completely  elementary.  A 
generalization  of  the  basic  estimate  of  the  proof  is  given  in  Appendix  1. 
From  the  discussion  in  the  preceding  section,  we  expect  S(t)x  = u(t)  to  be 
a solution  of  du/dt  + Au  = 0,  w(0)  = x,  in  some  sense,  but  the  above 
theorem  contains  no  assertion  to  that  effect.  Indeed,  when  dealing  with 
partial  differential  equations,  the  precise  notions  of  solution  that  are  suitable 
to  the  problem  at  hand  can  be  rather  subtle,  and  one  must  always  pay 
attention  to  this  point.  In  any  result  for  (ACP)  general  enough  to  cover 
examples  from  partial  differential  equations,  one  can  expect  to  worry  a little 
about  the  appropriate  notion  of  solution,  a problem  not  present  in  most 
of  classical  ordinary  differential  equations.  It  is  worthwhile  being  more 
general  while  discussing  this  point. 
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Let  A be  a subinterval  of  R.  Lloc(:A,  X)  denotes  the  space  of  functions 
/:  .A  -»  X that  are  (strongly)  integrable  on  compact  subsets  of .A  equipped 
with  the  topology  of  L1  convergence  on  compact  subsets  of  .A.  (The  reader 
unfamiliar  with  integration  of  X -valued  functions  can  simply  pretend  X = R* 
when  an  integral  appears.)  Below/  g e L'loc(:A,  A")and  A is  an  operator  in  X. 
Consider  the  evolution  equation: 

(du/dt)  + Ausf  (E  )f 

We  want  to  say  when  a function  u is  a solution  of  (E)f.  Three  notions  (at 
least ) are  appropriate.  First,  u is  a strong  solution  of  (E),  on  A if  u € C(.A,  X ) n 
l^.l(A,  X)  and  u'(t)  + Au(t ) a J (t)  a.e.  t e /A.  Here  C(:A,  X)  is  the  space  of 
continuous  maps  from  A into  X with  the  topology  of  uniform  convergence 
on  compact  subsets  of  .A  and  ue  X)  means  that  there  is  a 

ve  Ll'oc(.y,  X)  such  that 

f 

u(t)  — u(s)  = ) v(r)  dr  for  t , s e .A 

* J 

[in  which  case  u‘(t)  = (du/dt)(t)  - v(t)  a.e.  t e '/].  The  notion  of  a strong 
solution  is  such  that  one  can  use  most  of  the  standard  arguments  of  calculus 
in  studying  solutions  of  (E)y.  The  most  obvious  way  to  weaken  this  notion 
is  to  "take  limits”  of  strong  solutions  in  a manner  so  that  conclusions 
reached  by  doing  calculus  may  be  preserved.  Thus  u is  called  a weak  solution 
of  (E)y  on  .'/•  if  there  is  a sequence 

{("nJnYl^^Cl  'A  X)  X LU'AX) 

such  that  un  is  a strong  solution  of  (E)/non  'A  and  (u„,  /„)  ->  («,/)  in  C(  A,  X) 
* LlA-A,  X).  Finally,  u is  called  an  integral  solution  of  (E)^  on  A provided 
u e C(.A,  X)  and 

NO  - *11  - NO  - *1  < | HO  - *,/(*)  - y]+  da 

Js  (2.5) 

for  t > s,  l,  s «=  .A,  x e D{A ),  y e Ax. 

The  integral  above  is  well  defined  by  virtue  of  the  facts  that  [ , ]+  is  upper- 
semicontinuous  [see  (2.2)]  and  |[x,  y]+|  < ||y||  (from  the  definition).  The 
notion  of  integral  solution  is  appropriate  only  if  A is  accretive  and  it  arises 
from  the  following  considerations.  First,  using  the  definitions  of  [ , ]±  one 
easily  shows  that  if  u:  A -*  X then 

- [“(<).  • 


= [»(<).  (2.6) 
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where  D(,  Dr  are  left  and  right  differentiation,  respectively.  Equations  (2.6) 
are  to  be  understood  in  the  sense  that  if  (£>,  u){t)  exists,  then  so  does  D;||u(f)|| 
and  the  equation  holds,  etc.  It  follows  that  if  u and  ,'u||  are  differentiable  at 
/,  then 

^ NOII  = “'(')]  + = [“(O’  “'(0J-  • (2-7) 

In  (2.7)  the  differentiability  of  u does  not  imply  that  of  j|u||  [e.g.,  u(t)  = t , 
X = R,  so  ( u(r) | = |t|].  Now  if  u is  a strong  solution  of  u + Auafand  v 
is  a strong  solution  of  v'  + Av  3 y on  then  || u(r ) — c(r ) ||  is  locally 
absolutely  continuous,  and  we  have 

d ||u(0  - 001  = [“(0  “ 00’  “'(0  - ”'(')]- 
dt 

= [Of)  - 00’  (“'(f)  -/(f))  - W(t)  - Of))  + (/(f)  - 00)]- 
< [Of)  - Of)’  -((0(0  - f00)  - (/( 0 - “ (0))]- 
+ [00  - 00’/(0  - 0(0]+ 

<[0f)-00’/(f)-00]+ 

a.e.  on  J/.  In  the  above  calculation  we  used  the  results  [x,  y + z]_  < 
[x,y]_  + [x,z]+  (this  is  easy  to  establish),  [x,  -y]_  = -[x,  y]  + , g(t)  - v'(t)e 
Av(t)  a.e., /(f)  - u'(f)e  Au(t)  a.e.,  and  A satisfies  (2.3)  since  it  is  accretive. 
The  above  inequality  shows  that 

IIOO  - 0011  - NO  - 001  ^ I [“(*)  - 0«)’/(a)  - 0(a)]+  d*  (2-8) 

J S 

if  u is  a strong  solution  of  (E)7  on  v is  a strong  solution  of  (E)g  on 
./,  and  s < f,  f,  .s  e ./.  If  y e Ax,  then  v = x is  a solution  of  v'  + Av  3 y.  Setting 
v = x,  g = y in  (2.8)  yields  (2.5).  Thus  a strong  solution  of  (E)f  is  an  integral 
solution  of  (E)f.  Moreover,  since  [ , ]+  is  upper-semicontinuous,  a weak 
solution  of  (E)7  is  an  integral  solution. 

Let  us  return  to  (E)0  and  the  functions  S(f)x  of  Theorem  1.  To  indicate 
the  dependence  of  S on  A,  we  write  S(t)  = S4(f)  if  S is  obtained  from  A via 
(2.4).  A is  said  to  be  closed  if  its  graph  G(A)  = {(x,  y) : x g X,  ye  Ax}  is 
closed.  The  closure  of  A,  A,  is  defined  by  G(A)  = G(A\  A is  accretive  if  A 
is  accretive.  Some  basic  facts  follow.  If  A is  closed,  then  S,<(f)x  is  a strong 
solution  of  (E)0  on  t > 0 for  x e D(A ) iff  SA(t)x  is  differentiable  a.e.  The 
proof  of  this  uses  the  fact  that  SA(t)x  is  an  integral  solution  of  (E)0  for 
x e D(A).  Moreover,  SA(t)x  is  Lipschitz  continuous  in  t if  x e D(A)  (the 
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proof  of  Theorem  1 shows  this).  Thus  if  X has  the  property  that  Lipschitz 
continuous  functions  of  a real  variable  with  values  in  X are  differentiable 
a.e.,  then  S^rjx  is  a strong  solution  of  (E)0  for  x e D(A),  and  a weak 
solution  for  x e D(A).  Reflexive  spaces  have  this  property.  However,  there 
are  examples  in  which  ,S^(f)x  is  nowhere  differentiable  in  t > 0 for  any 
xe  D(A).  The  question  thus  arises  of  whether  or  not  integral  solutions  of 
(E)0,  or  more  generally  (E)/5  are  uniquely  determined  by  Cauchy  data. 
The  answer  is  given  by  the  following  result  of  Benilan  [3],  which  we  state 
informally  here  and  prove  in  Appendix  2:  If  u e C(.^,  A")  is  obtained  as  the 
uniform  limit  of  solutions  of  backward  difference  schemes  converging  to 
(E )f,  then  (2.8)  holds  for  every  integral  solution  v of  (E)g  on  Moreover, 
note  that  the  set  of  (u,/),  (v,  g)eC( <P,  X)  x L’oc(.^,  X)  satisfying  (2.8)  is 
closed.  Setting  /=  g in  (2.8),  we  see  that  if  u is  obtained  as  the  limit  of 
solutions  of  backward  difference  schemes  converging  to  (E)f,  then  integral 
solutions  of  (E)^  with  the  initial  value  u(0)  coincide  with  u on  t > 0.  In 
particular,  SA(t)x  = u(t)  is  the  unique  integral  solution  of  (E)0  on  t > 0 
satisfying  u(0)  = x.  Since  strong  solutions  are  integral  solutions,  if  this 
problem  has  a strong  solution,  it  must  coincide  with  S A(t)x. 

The  existence  result  Theorem  1 can  be  used  to  obtain  solutions  of  (E);. 
To  this  end,  assume  for  simplicity  that  A is  m-accretive,  i.e.,  A is  accretive 
and  R(l  + kA)  = X for  l > 0.  Let  T > 0,  x e D(A),  and  /be  a step  function 
on  [0,  T),  f(t)  = y,  on  n,_ , < t < a, , where  0 = a0  < ax  < a2  < ■ • ' < u„  = T. 
Then  u + Au  sf  is  the  same  as  u'  + Atu  aO  on  a,_ , < t < aif  where 
Atx  = Ax  — y,.  Now 

fSAi(/  - ci,  _,)«(«,_,),  «,_!</<«,,  / = 1,  2, ...,  n, 

u(t)  = 

|x,  t = 0, 

is  a solution  of  (E)f,  u( 0)  = x obtained  as  the  uniform  limit  of  backwards 
difference  schemes  converging  to  (E)f . Thus,  if/  and  g are  two  step  functions 
on  [0,  T]  and  uf,  ug  are  obtained  as  above,  (2.8)  holds  with  u = uf,  v = ug. 
Setting  v = 0,  this  yields 

|| «/(')  ~ M*(0ll  ^ I [«/(*)  ~ ue(*h  /(*)  - y(a)]+ 

• o 

< j || /(a)  — g( a)||  da,  for  0 < t < T. 

• o 

For  general/ e L'([0,  T],  A),  the  above  considerations  imply  that  whenever 
{/„}  is  a sequence  of  step  functions  converging  to  / in  L'([0,  T],  X)  then 
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u(t)  = limn  u,  (l ) exists  and  is  the  unique  integral  solution  of  (E)y  on  [0,  7 ] 
satisfying  u(0)  = x. 


3.  Examples 

Demonstrating  that  a given  evolution  problem  in  partial  differential 
equations  falls  within  the  scope  of  Theorem  1 via  a reduction  of  the  type 
outlined  in  Section  1 can  be  quite  involved.  It  usually  requires  tools  from 
the  technical  machinery  of  partial  differential  equations.  Moreover,  one 
encounters  extra  subtleties  when  the  problem  is  set  in  a nonretlexive  space, 
and  we  restrict  our  attention  to  this  case. 

An  exception  to  these  comments  arises  when  there  is  only  one  space 
variable.  Here  the  semigroup  approach  can  be  rather  striking,  for  by  it  the 
study  of  a problem  in  partial  differential  equations  is  reduced  to  the  study 
of  an  ordinary  differential  equation.  Since  we  do  not  assume  that  the  reader 
is  familiar  with  Sobolev  spaces  and  so  on,  the  examples  below  are  given 
for  the  case  of  one  space  variable  and  we  shamelessly  make  strong 
unnecessary  assumptions  to  simplify  the  proofs.  These  examples  give  a hint 
of  the  scope  of  the  abstract  theory  and  how  to  show  a given  problem  may 
be  subsumed  by  it. 

The  reader  with  experience  in  partial  differential  equations  may  prefer 
to  go  directly  to  the  literature  cited  in  Section  6.  While  this  literature 
is  steadily  increasing,  we  want  to  emphasize  that  the  study  of  applications 
is  just  beginning.  We  not  only  expect  many  new  applications  to  be  found, 
but  also  refined  results  for  known  applications. 

Example  1.  A quasilinear  hyperbolic  equation.  Consider  the 
problem 


(CL) 

u,  + (0(h)),  = 0, 

t > 0,  0 < x < 1 , 

(IC) 

«(0,  x)  = w0(x). 

0 < x < I, 

(3.1) 

(BC) 

u(f,  0)  = 0, 

t >0, 

consisting  of  the  conservation  law  (CL),  the  initial  condition  (IC).  and  the 
boundary  condition  (BC).  In  (3. 1 ),  0:  K -*  R.  To  have  a very  simple  presenta- 
tion, we  assume  0 is  continuous,  strictly  increasing.  0(0)  = 0,  and  0(R)  - IK. 
Following  the  outline  of  Section  1,  we  set  X = ZJ([0,  1]).  D(A)  = 
Jm  e C([0.  1 ]) : m(0)  = 0 and  0(m)  is  absolutely  continuous),  and  Au  = (0(u))' 
for  u(  D[A)  ( w ' denotes  the  derivative  of  w).  Thus  A acts  in  accordance 
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with  (CL)  and  D(A)  consists  only  of  functions  satisfying  (BC).  We  claim  A 
is  m-accretive  in  L'([0,  1]). 

First  we  show  that  A is  accretive.  Let  u,  v e D( A)  and  A > 0.  Let  p:U-*U 
be  Lipschitz  continuous  and  satisfy 

P is  nondecreasing,  |p|  < 1,  and  p(0)  = 0.  (3.2) 

If  ./'(•'■)  = Jo  P(t)  dx,  then 

| (An  - Av)p((t>{u)  - 0(f))  dx  = | (0(u)  - (f>(v))' p((p(u)  - 0(f))  dx 
• 0 -o 

= I ./#(“)  - 0(f))'  dx 
• 0 

= /(0(u(l))-0(r(l)))>  0,  (3.3) 

since  0(i/( 0))  = 0(p( 0))  and  / > 0.  Also, 

(u  - v)p((f)(u)  - (p(v))  = \u  ~ rj  | p(4>(u)  - 0(p))|, 
since  p and  (j)  are  increasing.  Thus,  by  (3.2)  and  (3.3) 

,i  ,i 

) | u — v + /.(An  — Ar) | dx  > | ((u  — v)p((p(u)  — 0(r)) 

• o -o 

+ /.(An  - Av)(p((f>(u)  - 0( t’))))  dx 

>|  |u  - 17 1 I p(4>(u)  - 0(f))  | dx.  (3.4) 

• 0 

Finally,  let  p — pn  in  (3.4),  where 

Pn  S jsign.v  if  |s|  > n~  *, 

and  then  let  n -+  oo.  Since  (u  — v )p„(0(w)  — 0(f))  ->•  |m  — v |,  (3.4)  then  yields 
|| u — v + A(Au  — Av)\\i  > ||m  — f |j ! [where  |j  ||p  is  the  norm  in  Lp([0,  I]), 

1 < p < oc].  Thus  A is  accretive. 

It  remains  to  show  R(l  + AA)  = C([0,  1])  and  it  suffices  to  assume  A = 1. 
Given  h e L‘([0,  1])  we  seek  u e D(A)  such  that  u + 0(t<)'  = h.  If  /I  = 0_  1 
and  v = 0(tt)  the  problem  becomes  (i(v)  + v'  — h , v(0)  = 0,  and  v is  absolutely 
continuous.  By  standard  continuation  arguments  of  ordinary  differential 
equations,  it  suffices  to  prove  that  if  v is  absolutely  continuous  on  [0,  a],  a > 0, 
«?(0)  = 0,  and  (i(v)  — v'  = /?  a.e.,  then 

| \ft(v)\  dx  < I \h\dx, 

• 0 -o 
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for  this  implies 

\v(x)\  < | |i/(s)|  ds  = | \h(s)  - fi(v(s))\  ds  < 2 | \h(s)\  ds 
•'  0 ‘0  -o 

for  0 < x < a.  However,  this  follows  as  above.  A is  accretive,  so  if  u = (i(v) 

| | u | dx  < I |()  + AO  — (u  + Au)\  dx  = j \h  \ dx. 

• o -o  -o 

[In  the  proof  that  A is  accretive,  we  could  have  used  L’([0,  a])].  The  sketch 
is  complete. 

Thus  (3.1)  may  be  reduced  to  the  form  (ACP)  with  an  /w-accretive  A, 
and  (3.1)  therefore  has  a unique  integral  solution  defined  by  this  A for  all 
u0  g D(A  ).  One  can  show  that  D(A  ) = L'  ([0,  1 ])  here.  This  type  of  example  is 
especially  interesting.  It  generalizes  (with  different  proofs)  dramatically.  In 
particular,  one  can  treat  problems  of  the  form 

* d 

WI  + X - (4>i(w))  = f°r  1 > 0’  X G UN, 

i = 1 ( X, 

where  0:  (R  -»  UN  need  only  be  continuous  and  satisfy 

lim  |0(r)|/[r  |<v_  l,/S  = 0. 

r-0 

The  Cauchy  problem  for  this  equation  does  not  admit  continuous  solutions 
even  if  cp  an<J  the  data  are  smooth  [nor  does  (3.1)].  This  equation  is 
interpreted  as  a family  of  integral  inequalities  by  Kruzkov  [43J.  Here  we 
see  that  the  difficulties  associated  with  understanding  SJt)x  as  a solution 
of  (ACP)  appear  in  concrete  examples.  See  the  works  cited  in  Section  6 
for  more  information. 

Example  2.  A quasilinear  parabolic  problem.  Consider  the 
problem 

u,  — (<p(u))xx  = 0,  t > 0,  0 < x < 1, 

u( 0,  x)  = u0(x)’  0 < x < 1,  (3.5) 

u(t,  0)  = u(t,  1 ) = 0,  t > 0. 

Again,  let  X = L*([0,  1]),  0:  IR  -►  1R  be  continuous,  strictly  increasing, 
0(0)  = 0,  and  0(R)  = R.  Set  D(A ) = {u  e C([0,  1]):  m(0)  = u(  1)  = 0,  4>(u)  and 
0(u)  are  absolutely  continuous},  and  Au  = — (p(u)"  for  u g D{A ).  If  p satisfies 
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(3.2),  then 

| (Au  - Av)p(4>(u)  - 0(i))  dx  = | p'((p(u)  - 4>(v))(((p(u)  - M v))')2  dx  > 0. 

• o -o 

(3.6) 

The  fact  that  A is  accretive  follows  from  (3.6)  as  it  did  from  (3.3)  in  the 
first  example. 

Let  h e L'([0,  1]).  To  solve  u + Au  = h we  set  v = 0(u),  (i  = (p~ 1 as  before 
and  solve  instead  /f(i)  — v"  = h,  v(0)  = l(1)  = 0. 

Step  I.  Assume  fi  is  bounded,  \(i(v)\  < K for  ve  IR.  Let 

Tv  = \ lg(x,  yMv(y))  - h(y))  dy  for  t;  e L‘([0,  1]), 

• o 

where  y{x,  y)  = >'(x  - 1),  0 < y < x < 1,  and  g(x,  y)  = c/(y,  x),  0 < x,  y < 1. 
That  is,  w = Tv  is  the  solution  of  w"  = fi(v)  — h,  w( 0)  = w(l)  = 0.  A trivial 
estimate  shows 

||Tt,|u  < ||«„)  - Ml.  £ ««■>)!! i + 11*11,  < k + 11*11,. 

and 

II(MIUsIW»)-*II.sk  + ||*I|,. 

Moreover,  T:  L‘([(),  1])-»L'([0,  1])  is  continuous.  It  follows  that  T is  a 
continuous  mapping  of  L!([0,  1])  into  the  compact  convex  subset 
{w  e L'([0,  1]):  w is  absolutely  continuous  and  ||>v||x,  ||vv'||x  < K + ||/i||,}. 
(The  compactness  follows  from  Ar/.ela  Ascoli.)  T has  a fixed  point  by 
Schauder’s  fixed-point  theorem,  and  a fixed  point  of  T is  a solution  of  our 
problem. 

Step  2.  Since  A is  accretive,  if  /i(v)  e D(A)  and  fi(v)  — v"  = h,  then 
/Mill  < \\P(v)  - v"  - (0(0)  - 0'')||,  = ||#i||a. 

Moreover,  since  r(0)  = t(  1 ) = 0,  t'(£)  = 0 for  some  £ e (0,  1 ),  and 

M*)|  < I v"(s)ds  <|  \fi{v)  - h\  dx  < 2(|/i(( ^ 

-o 

|p(x)|  < I v'(s)  ds  <1  \v'{s)\  ds  < 2||/i||, 

J0  *0 


Finally, 
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for  x e [0,  1].  For  unbounded  /(,  set 

| 2||/i|j j,  if  P(s)>  2||f.|lt 

fi(s)=  Pis),  if  \P(s)\<2ih‘l, 

1-21*11,,  if  P(s)  < —2  h x. 

Now  ft  is  bounded  and  the  solution  v of  Ji(v)  - v"  = h,  v(0)  = t ( 1 ) = 0 (which 
exists  by  Step  1)  satisfies  ||t;||x  < 2 h , (as  above).  But  then  ft(v)  = P(v) 
and  v is  the  desired  solution. 

The  study  of  quasilinear  and  semilinear  parabolic  equations  that 
correspond  to  m-accretive  operators  is  the  most  highly  developed  area  of 
applications.  See  Section  6 for  references. 


Example  3.  A nonlinear  operator  accretive  in  C([0,  1]).  This  final 
example  is  related  to  the  preceding  one.  However,  the  underlying  space 
will  be  C([0,  1])  and  this  time  A can  be  multivalued.  Consider  the  problem 

— uxx  = 0,  t > 0,  0 < x < 1, 

u(0,  x)  = u0(x),  0 < x < 1,  (3.7) 

u(f,  0)  = u(t,  1)  = 0,  t > 0, 

where  (f>:  IR  -*  R is  continuous,  nondecreasing,  and  0 = 0(0).  Note  that  we 
do  not  assume  0 is  strictly  increasing  or  0(R)  = R.  To  define  the  A associated 
with  '3.7),  set 

X = C([0,  1]), 

D(A ) = {u  e C[0,  1 ] : u,  u',  u"  are  continuous  and  u(0)  = u(  1 ) = 0), 

and 

Au  = (w  e C([0,  1]) : 0(-w)  = u"\  for  ueD(A). 

The  reader  should  ponder  the  relationship  of  (3.7)  to  this  A for  a moment. 

To  show  A is  accretive,  let  u„  u2eD(A ) and  w,  e Auu  w2€/lu2.  Let 
m,  # w2  and 

“i  - n2 1| x = max  | w,(x)  - u2(x)|  = u,(x0)  - u2(x0). 

0<iSl 


(3.8) 
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Since  u,  = u2  = 0 at  x = 0,  1 we  have  0 < x0  < 1.  If  w2(x0)  < Wj(x0)  and 
/.  > 0,  then 

max  |u,(x)  - u2(x)  + /.(w,(x)  - w2(x))| 

0<x<,  1 

> (M^o)  - M^o))  + A(w,  (jc0>  - W'2(^o)) 

> U j (x0)  - u2(x0)  = | u,  - u2  x, 

and  we  are  done.  If  w2(x  o)  > (xo)>  the  monotonicity  of  4>  implies 

0(-W2(x))<  cp(  — w,(x))  on  some  interval  x0  — 6 < x < x0  + <5,  where 
6 > 0.  Then 

K - u2)"(x)  = tp(  w j (x))  - (f)(-w2(x))>  0 

on  this  interval.  Hence  u,  — u2  is  convex  on  x0  — S < x < x0  + 6 and 
assumes  its  maximum  at  the  interior  point  x0.  Then  u,  — u2  is  constant  on 
x0  — 6 < x < x0  + d.  If  x0  is  the  least  number  in  [0,  1]  for  which  (3.8)  holds, 
this  cannot  be.  With  this  choice  of  x0,  w,(x0)  > w2(x0),  and  we  have  now 
shown  that  A is  accretive. 

To  show,  e g.,  R(l  + A)  = C([0,  1]),  choose  h e C([0,  1])  and  solve 
h e (I  + A)u  or  h — u e Au.  This  means  u(0)  = u(l ) = 0 and  u"  = 4>(u  — h). 
This  problem  can  be  solved  in  a fashion  similar  to  that  of  Example  2. 
Since  A is  accretive  and  0 e .40,  if  u is  a solution  then  u||x  < ||/i||x. 
Truncate  (p  by  setting 

| 2||/i|U  if  (p(s)  > 2||/i||x , 

<Rs)  = <p(s)  if  \<t>(s)\  ^ 2 

|-2||/i|U  if  |#s)l  < -2||fc||«. 

Let  Tv  be  the  solution  w of  w"  = (f)(v  — h),  w(0)  = w(l)  = 0.  T is  continuous 

and  maps  C’([0,  1])  into  a compact  convex  subset.  It  has  a fixed  point, 
which  is  the  solution  of  h e (/  + A )u. 

Now  we  must  be  cautious.  While  it  is  true  that  A is  m-accretive,  consider 
the  case  <p(s)  = 0 for  all  s e R.  In  this  case 

\X  if  u = 0, 

Au=\t 

\<p  if  u / 0, 

and  D(A)  = {0}.  Of  course,  if  <p  = 0 one  can  only  solve  (3.7)  for  u0  = 0,  in 
which  case  u = 0.  To  know  for  which  u0  S jt)u0  is  defined  we  must  know 
[)( A ).  It  is  not  too  hard  to  show  D(A)  = \u  e C([0,  1]) : u(0)  = u(l)  = 0}  iff 
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0(K)  = IR.  The  reader  may  consider  other  cases  or  refer  to  the  literature 
cited  in  Section  6.  If  the  fact  that  we  omitted  mention  of  D(A  ) in  Example  2 
was  not  noticed  before,  we  reassure  the  reader  that  D(A)  = L‘([0,  1])  under 
the  hypotheses  of  that  example. 


4.  Auxiliary  Results:  Perturbation  and  Continuous  Dependence 

The  abstract  theory  of  Section  2 is  useful  in  applications  in  that  it  provides 
existence  and  uniqueness  theorems  for,  e.g..  the  examples  of  Section  3 as 
well  as  a notion  of  solution  for  certain  problems  that  do  not  admit 
differentiable  solutions.  Frequently  questions  of  existence  and  uniqueness 
may  be  handled  by  other  means  and  one  may  ask  why  he  should  bother 
with  the  abstract  theory.  There  are  several  answers  to  this.  The  abstract 
theory  provides  insight  into  the  concrete  problems  to  which  it  applies  and 
exhibits  common  features  of  problems  that  otherwise  appear  to  be  totally 
unrelated.  Another  benefit  of  casting  a concrete  problem  into  a form  to  which 
the  general  theory  applies  is  that  one  can  then  make  use  of  the  auxiliary 
results  of  the  theory.  Chief  among  these  are  the  convergence  theorems 
(which  deal  with  questions  of  continuous  dependence)  and  the  perturbation 
theorems.  Examples  of  these  types  of  results  are  stated  here. 

Convergence  theorems  deal  with  the  dependence  of  solutions  of 

(du/dt)  + Au  9/  (E)  e/ 

on  A and  f.  Given  a sequence  {A„}*sl  of  accretive  operators,  we  want  to 
define  a notion  of  convergence  A„  ->  A.  Recalling  the  considerations  of 
Section  1,  a natural  idea  is  to  use  the  resolvents  (/  + /.An)~l  (there  are 
other  ways).  For  simplicity,  assume  each  An  is  w-accretive.  Then  we  say 
A„  -*  A if  A is  m-accretive  and  (/  + /.An)~  ‘.x  -» (/  + kA)~  *.x  for  each  A > 0, 
xeX.  A typical  example  of  a convergence  theorem  says  the  following: 
Let  {A„)*=0  be  a sequence  of  m-accretive  operators  An-*A0,  {/„}*=0  a 
sequence  in  ^([0,  7],  X),f„-*f0  in  L'([0,  7],  *),  x„eD(A„ ),  and  ,x„  -► 
.x0  e D(A0).  If  un  is  the  unique  integral  solution  of  (E)^  uB(0)  = xn,  then 

lim„  un  = u0  uniformly  on  [0,  7], 

In  applications,  convergence  theorems  supply  information  about  the 
continuous  dependence  of  solutions  of  partial  differential  equations  on  the 
equations.  An  interesting  instance  of  this  type  of  result  is  given  by  Kurtz  [44], 
who  showed  that  solutions  of  the  scaled  Carleman  equations 

u,  + ocux  + a2(u2  — v2)  — 0,  v,  — vix  + v2(r2  — u2)  = 0, 
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with 

11  It  = o = k’  |i  = o >0, 

yield,  in  the  limit  as  * -»  x,  solutions  of  w,  - £(log  w)XJ£  = 0.  (This  does  not 
exactly  fit  into  the  framework  given  above.  See  [44]  ) 

Perturbation  theorems  supply  information  about  the  perturbed  problem 

(du/dt)  + Au  + Bu  3 f (PP) 

when  enough  is  known  about  the  unperturbed  problem  (E )A  f and  B.  For 
example,  if  A is  m-accretive  and  B:  X -*  X may  be  written  as  B = fl0  + B1( 
where  B0  is  continuous  and  accretive  and  Bl  Lipschitz  continuous,  then  for 
x e D(a)J  e L’([0,  T].X),  there  is  a unique  u e C([0.  T],  X)  satisfying  u(0)  = x 
such  that  u is  an  integral  solution  of  u + Au  3 f — Bu  on  [0,  T\  This  result 
can  be  used  to  show  that  A + B is  m-accretive  whenever  A is  m-accretive 
and  B is  continuous  and  accretive. 

For  example,  the  operator 

Au  = — Au  on  D(  4)  = [u  e BU(R^) : Au  e BU(R'')\ 

is  m-accretive  in  X - BU(US ) (bounded  uniformly  continuous  functions  on 
M").  Now  it  is  simple  to  see  that  u -*  /i(u)  is  continuous  and  accretive  in 
X if  (i\  R -»  R is  continuous  and  nondecreasing,  while  u -» y(u)  is  Lipschitz 
continuous  in  X if  y:  R -*  R is  Lipschitz  continuous.  Thus,  by  the  perturba- 
tion theorem,  we  can  solve,  e.g., 

(du/dt)  — Au  + eu  + sin  u = f(t,  x),  u(0,  x)  = u0(x)^ 

for  u0  6 X and  suitable  J,  simply  from  the  knowledge  that  A is  m-accretive 
in  X 


5.  Tangency  Conditions 

Let  C be  a closed  subset  of  a Banach  space  X and  B:  C -*  X be 
continuous.  In  order  that  the  problem 

(du/dt)  + Bu  = 0,  u(0)  = x e C,  (5. 1 ) 

have  a continuously  differentiable  solution  u:  [0,  T]  -»  C on  some  interval 
[0,  T],  T > 0,  B clearly  must  satisfy 

lim  sup  d(x  — /. Bx , C)/k  = 0 

n i o 


for  x e C, 


(5.2) 
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where  il( z,  C)  = inf  [ z — y : y e C}.  This  is  because  then  u(t)  = x - tBx 
+ o(t)e  ( as  l l 0.  Brezisand  Bourguignon  [9]  show  that  the  Euler  equations 
of  hydrodynamics  may  be  cast  in  the  form  (5.1).  where  B is  locally  Lipschitz 
continuous  and  satisfies  (5.2).  It  follows  from  the  more  general  results  of 
Martin  [46]  that  under  these  conditions  (5.1 ) has  a solution  u:  [0,  7 ] -*  C 
for  some  T > 0 prov  ided  x € C.  The  main  point  here  is  that  B is  only  defined 
on  a closed  set  C and  C may  have  empty  interior.  (Of  course,  B may  be 
extended  to  a continuous  function  defined  on  a neighborhood  of  C.  but  the 
extension  process  cannot  preserve  Lipschitz  continuity  or  accretive 
properties  of  B in  general.) 

We  wish  to  make  several  points.  First,  there  is  the  unexpected 
correspondence  between  ordinary  differential  equations  of  the  form  (5. 1 ) with 
continuous  B satisfying  (5.2)  and  a difficult  problem  in  partial  differential 
equations.  This  is  in  contrast  to  the  discussion  in  Section  1.  Note,  however, 
the  problem  of  "small  domain"  persists,  albeit  in  another  guise. 

Second,  although  this  setting  appears  quite  different  from  that  of  the 
previous  section  it  is  in  fact  closely  related.  To  see  this,  note  that  if  .x  € C 
and  (5.2)  holds,  then  for  t:  > 0 there  exists  .x;  in  C and  ey  in  X defined  for 
small  /.  > 0 such  that  ,x  — xfl.x  = xx  4-  where  ejt  < Then 

I!*  - xA\\  < /(  Bx  + f;), 

and 

.xA  + /.Bx.  = x + /-(B.x^  — Bx)  + Ae-  = x + <>(/.), 
as  /.  I 0.  Thus  .x^  -»  x and 


X ! — X 


+ B.X; 


B.X;  - Bx 


+ c < 2c, 


(5.3) 


provided  /.  is  small  enough.  In  [47]  Martin  builds  approximate  solutions 
to  (5.1)  by  starting  at  .x,  choosing  A small  enough  to  satisfy  various 
restrictions,  interpolating  linearly  between  x and  x^  on  [0,  /],  and  then 
repeating  the  process.  This  is  equivalent  to  solving  approximately  a back- 
wards difference  scheme  as  indicated  by  (5.3).  To  make  this  clearer,  consider 
the  case  C = X so  (5.2)  holds  for  all  B:  X -*  X.  To  construct  a polygonal 
approximate  solution  to  (5.1)  with  nodes  at  0 = t0  < t,  <•••</*  we  set 
y,  = r,  — solve  the  forwaid  (explicit)  difference  scheme 

X|  ~ Xi_  1 + Bx,_  , = 0,  i = i,  2 /V,  x0  = x,  (FS) 
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and  then  interpolate  the  values  x,  at  r,  piecewise  linearly.  This  yields  the 
familiar  expression 

u(t)  = x,  — (t  — f,)Bx,,  tt  < t < tl+l, 

of  polygonal  approximation.  The  solution  of  (FS)  is  x,  = (/  - y , B)x,_  j,  so 
Bx,  above  is  y,V j(x,  - x,  + l).  Observe  now  that,  using  (FS),  we  have 

V‘  " X‘-  1 + fix,  = Ax,  - Bx,_  , = (B((/  - 7,  Bk- 1 ) - «*,-  i ), 

/ i 

so  that  {X|}  approximately  solves  the  implicit  scheme  y,-  1 (x,  — x,  _ , ) 
+ Bx  j = 0.  (The  rth  error  tends  to  zero  with  y,  if  B is  continuous.) 

The  fact  that  (5.2)  is  equivalent  to  being  able  to  solve 

x;  + ABxy  = x + o(A)  and  jx  - xj  -» 0 as  A j 0 (5.4) 

for  X;  e C was  observed  independently  by  the  author  and  J.  Yorke.  Construc- 
tion of  approximate  solutions  to  (5.1)  under  (5.2)  via  the  explicit  scheme 
is  carried  out  in  Martin  [46]  and  announced  for  the  implicit  scheme  by 
Yorke  and  Kaplan  [30]  (in  a setting  general  enough  to  include  the  case  of 
Theorem  1 of  Section  2 as  well).  Now  it  is  a happy  fact  that  the  convergence 
of  solutions  of  implicit  schemes  converging  to  (5.1)  can  be  proved  by 
generalizing  the  proof  of  Theorem  1 a bit.  This  was  announced  by  Yorke 
and  Kaplan  [30].  A rather  sharp  form  of  the  principal  estimate  is  given  in 
Appendix  1.  Martin’s  convergence  proof  in  [46]  was  difficult,  so  it  is  nice  to 
have  Theorem  1 and  (5.1).  (5.2)  fit  in  the  same  framework.  See  also  the 
comments  on  Appendix  1 in  Section  6. 

Before  we  leave  this  topic,  we  obtain  another  consequence  of  (5.4).  Let 
x,  , xu  e C,  i = 1,2,  satisfy 

xu  + ABxu  = x,  + o(/)  as  A [ 0,  5v 

II*,-  “ “>0  as  ;-l°’ 

for  i =l,  2.  Recall  the  functions  [ , ]t  of  Section  2.  B is  accretive  if 
[x,  — x j , Bx,  - Bx2]+  >0  for  x,  eC.  Since  [.]♦£[,]-,»  stronger 
requirement  would  be  [x,  — x2,  Bx,  — Bx2]_  >0.  In  fact,  these  are 
equivalent  if  (5.2)  holds.  Interest  in  this  point  arises  from  the  results  of 
Martin  [46],  in  which  he  makes  assumptions  on  [ , ].  . We  give  the  proof. 
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Observe  that  for  / > 0 

[v,  - x2,  Bxl  - Bx2 ]_2 

= |*i  “ ^2|l  ~ |j(*i  - *2)  - MBx  1 - Bxa)|| 

/ 

= IK*n  - xzx)  + MBxu  - Bx2x)\\  - l\xu  - x2J  + o(k) 

/ 

*=  [*n  - *22’  Bxlx  - Bx2x]x  + o(1) 

> [*1A  - *2A ' Bxlx  - Bx2x]+  + o(l) 

as  / | 0.  The  relations  (5.5).  continuity  of  B,  and  (2.2)  have  been  used.  If 
(.v  1 x - x2x,Bxlx  — #.v2J+  > (>(xlx  - x2x),  where  p is  lower  semicontinuous, 
letting/,  j 0 yields  [v,  — v2.  fix,  - fix2]_  > p(xx  — x2).  In  particular,  if  B is 
continuous,  accretive,  and  (5.2)  holds,  then  [x,  - x2,  Bxx  — flx2]_  > 0 for 
x,  e This  is  a new  result. 


6.  References  and  Comments 

General  References.  This  chapter  can  be  regarded  as  an  extended 
introduction  to  the  paper  of  Benilan  [4],  With  the  main  exception  of 
Theorem  1,  most  of  the  abstract  results  mentioned  above  and  below  are 
proved  in  greater  generality  in  [4],  However,  the  emphasis  on  generality 
in  [4]  may  not  appeal  to  the  reader  at  this  stage  and  we  list  some  other 
sources  one  could  sample  first.  Brezis’s  excellent  monograph  [11]  is  useful 
for  background  material  as  well  as  the  highly  structured  Hilbert  space 
theory.  It  also  contains  a useful  commentary  and  a good  bibliography  up 
to  1971. 

The  manuscript  “Geometric  theory  of  semi  linear  parabolic  equations” 
(which  is  in  preprint  form)  by  Daniel  Henry,  University  of  Kentucky,  also 
deserves  a mention  here.  It  is  not  directly  concerned  with  accretive  operators, 
but  it  is  for  the  class  of  partial  differential  equations  treated  in  this  work 
that  one  finds  the  most  striking  applications  of  the  ideas  of  ordinary 
differential  equations  and  dynamical  systems. 

Our  references  are  selective.  Questions  of  priority  are  ignored  and  papers 
subsumed  by  later  works  (like  [4])  are  often  not  mentioned.  However, 
papers  not  cited  here  will  usually  appear  in  the  bibliography  of  either  [11] 
or  some  paper  we  do  quote. 
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Section  1.  This  section  is  a bit  misleading  in  that  the  theory  presented 
here  was  developed  internally  rather  than  with  applications  in  mind.  The 
original  goal  was  to  see  to  what  extent  the  linear  theory  of  semigroups 
and  the  Hille  Yosida  theorem  could  be  carried  over  to  nonlinear  cases. 
It  has  not  been  necessary  to  refer  to  the  linear  theory  before  now,  and  this 
theory  is  not  a prerequisite  for  understanding  the  nonlinear  case. 


Section  2 

Added  in  proof:  Recent  generalizations  of  Theorem  1 are  referenced  in 

the  comments  on  Appendix  1 in  this  section. 

On  multivaluedness.  We  made  no  real  attempt  here  to  convince  the 
reader  that  A : X -*  2X  is  a useful  generality.  One  example  is  given  in 
Section  3.  The  case  is  clearest  in  Hilbert  spaces.  See,  e.g.,  the  examples  in 
[12].  A trivial  but  significant  example  in  R is  the  operator 

|{1},  x>0, 

sign:  x ->  ' [-  1,  1],  x = 0, 

x < 0. 

The  theory  is  not  simpler  and  the  proofs  are  not  simpler  if  one  assumes  that 
A is  a function. 

On  [x,  >’]±-  The  functions  [x,  y]±  used  here  are  one-sided  directional 
derivatives  of  the  norm  j|x||.  In  many  papers  5||x||2  and  its  derivatives 
appear  instead,  with  the  corresponding  statements  suitably  modified.  (This 
is  due  to  the  fact  that  ||  ||2  is  nice  in  Hilbert  spaces.)  In  fact,  one  can  use 
p 1 1| x || p and  its  derivatives  ||x|jp-1[x,  y]±  for  1 < p < oc  and  develop  an 
equivalent  set  of  statements.  However,  it  is  now  clear  that  jj  ||  is  by  far  the 
most  convenient  in  general  Banach  spaces. 

There  is  another  description  of  [x,  y]+  . One  has 

[x,  y]+  = max  {x*(>') : x*  e X *,  ||x*||  < 1,  and  x*(x)  = ||x || }, 

where  we  regard  X as  a real  Banach  space.  Notation  (especially  the  author’s) 
varies  wildly  also  and  each  paper  must  be  checked  for  its  conventions. 

The  reader  should  compute  [ , ]±  for  the  cases  where  X is  a Hilbert  space, 
X = IS([ 0,  1]),  1 < p < oo,  and  X = C([0,  1])  to  orient  himself.  See  also 
Sato  [58], 
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Another  fact,  developed  in  [4],  is  that  one  can  use  more  general  sublinear 
functions  in  place  of  jj. 

Assume  now  that  A satisfies  the  hypotheses  of  Theorem  1. 

Exponential  formulas.  It  is  the  influence  of  the  linear  theory  that  led  to 
the  emphasis  on  the  "exponential  formula”  S A(t)  = lim„^x  (/  + (r/n)/l)_n 
rather  than  the  equivalent  but  more  fundamental  idea  of  scheme  (BD)  in. 
e g-.  [19]  and  [22].  (BD)  is  mentioned  only  in  passing  in  each  case. 

SA(t  ).v  is  a strong  solution  if  it  is  differentiable.  It  follows  from  Appendix 
2 that  S 4(t)x  is  an  integral  solution  of  (E)0  for  xe  D(A  ).  This  special  case 
of  Theorem  A. 2 is  obtained  more  simply  in  Miyadera  [51,  proof  of  Lemma  1 ]. 
This  is  done  in  the  form  appropriate  to  f 2.  The  proof  can  be  made  very 
short  using  instead.  Given  this  result,  one  easily  shows  u'(t)  + Au(t)  aO 
whenever  u(f)  = S^(f)x  is  differentiable  at  t > 0,  A is  closed,  and  x 6 D(a). 
See  [22,  pp.  277-278], 

If  ii  is  a strong  solution  of  (E)0,  w(0)  x,  then  t/( / ) — SA(t)x.  This  result 
does  not  require  Theorem  A. 2 either.  It  was  proved  in  [14]  that  lim£i0  J[‘ £|+  ‘x 
exists  and  is  equal  to  the  strong  solution  provided  the  latter  exists.  With 
the  convergence  known  in  advance,  the  proof  is  very  simple.  See  [ 19.  p.  162]. 
This  comment  and  the  one  above  justify  defining  S^(f)x  to  be  a solution  of 
(E)0.  However  Benilan's  Theorem  A. 2 unifies  all  results  like  this  and  the 
above  and  permits  a much  more  coherent  development. 

Strong  solutions.  It  is  sometimes  convenient  to  replace  M/1loc1(./.  A')  by 
W ‘^‘(interior  X)  in  this  definition. 

Weak  solutions.  The  term  weak  solution  has  a different  meaning  in  [54] 
(see  also  the  references).  With  our  definition,  it  is  not  known  if  S A(t)x  is  a 
weak  solution  of  (E)0  for  x e D(A). 

Integration  and  differentiation.  It  is  necessary  to  be  familiar  with  the 
theory  of  integration  and  differentiation  of  vector-valued  functions  to 
pursue  this  subject  farther.  See,  e g..  [65]  for  integration  and  the  appendix 
of  [ 1 1 ] for  differentiation. 


Infinitesimal  generators.  The  notion  of  a semigroup  of  contractions  on 
D(A)  was  defined  in  Theorem  1.  If  D(A)  is  replaced  by  an  arbitrary  C ^ X 
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in  this  definition  one  obtains  the  notion  of  a semigroup  of  contractions 
on  C\  The  linear  theory  suggests  the  problem  of  attempting  to  characterize 
the  infinitesimal  generators  of  semigroups  of  contractions  insofar  as  possible. 
Indeed,  the  nonlinear  theory  began  with  such  questions.  See  [19]  for 
definitions  and  a review  of  this  problem,  Miyadera  [52]  for  recent  results 
in  this  direction,  and  [4,  Chapter  IV]  for  another  point  of  view. 


The  equation  ilujdl  -f  A(t)u  3 0.  Results  for  this  equation,  which  is 
formally  more  general  than  the  case  A(t)u  = Au  - f(t),  are  given  in  [23], 
These  results  are  not  developed  in  terms  of  a notion  of  integral  solution. 
L.  C.  Evans  has  (in  a work  in  preparation)  recently  obtained  results  in  the 
setup  of  [23]  sufficiently  general  to  subsume  the  case  A(t)u  = Au  — f(t ) 
with  /(r)  e /J([0,  T\.  X)  directly. 

Note  added  in  proof:  See  L.  C.  Evans,  Nonlinear  evolution  equations. 

Mathematical  Research  Center  TSR  # 1568  (1975),  Univ.  of  Wisconsin, 
Madison,  Wisconsin. 


Seccion  3 

Nonlinear  diffusion.  Results  for  mappings  of  both  of  the  forms  Atu  = 
-Au  + (i(u)  and  A2u  = — A0(u),  where  ji  and  0 are  accretive  operators 
in  (R,  are  obtained  in  Brez.is  and  Strauss  [16],  It  is  a consequence  of  their 
results  that,  under  various  assumptions,  A , gives  rise  to  w-accretive 
operators  in  IF  spaces,  1 < p < oo,  while  the  form  A2  defines  an  w-accretive 
operator  in  L1.  Actually,  A2  is  not  explicitly  mentioned  in  [16].  Note, 
however,  that  the  “change  of  variable”  v = 0(u)  formally  transforms 
u -*  u — k A 0(u)  into  v -*  /i(v ) — /.  At’,  where  /f  = 0 '.  This  may  be  used  to 
make  the  connection.  Observe  that  fi  may  be  multivalued  even  if  0 is  not. 
For  example,  the  Stefan  problem  (see,  e g.,  [ 10])  may  be  written  in  the  form 
u,  — A 0(u)  = 0,  where  0 is  a function  and  0 1 is  not.  The  map  u — A 0(u) 
is  also  accretive  in  //  '.  See  [12]  and  [13]. 

The  equation  u,  — A(u7)  for  various  a arises  in  the  study  of  flow  through 
a porous  medium.  See  the  references  of  Konishi  [40],  Difficulties  not 
treated  in  [16]  arise  when  attempting  to  associate  an  w-accretive  operator 
with  u -»  — A 0(u)  in  L'(IR'v)due  to  the  infinite  measure  of  R*.  These  problems 
are  treated  in  [8],  The  operator  u -+  - A 0(u)  in  one  space  dimension  is 
studied  in  [21  ] when  0 4 0(0)  and  results  of  Kurtz.  [41]  for  0(u)  = log  u are 
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generalized.  One  finds  u,  - Ar/>(u ) studied  under  nonlinear  boundary  con- 
ditions in  [3].  The  problem  (p(ut)  — Au  = 0 is  treated  in  [7]  and  Konishi 
[41],  Many  of  Konishi’s  notes  [33  42]  are  related  to  parabolic  problems. 

Hyperbolic  systems.  Examples  of  nonlinear  hyperbolic  systems  genera- 
ting semigroups  of  contractions  in  L'  spaces  are  given  in  [19]  and  [44], 
These  examples  can  be  greatly  generalized  by  abstraction,  but  this  has  not 
been  written  down  yet. 

Conservation  laws.  An  excellent  introduction  is  given  by  Lax  [45].  The 
simplest  paper  written  from  our  point  of  view  is  [20].  Oharu  [54]  and 
Benilan  [5]  treat  the  technically  more  complex  case  where  <f>  = (p(x,  u ) 
depends  on  x as  well  as  u.  Benilan  also  settles  a question  about  D(A)  left 
open  in  [20].  Kruzkov’s  basic  paper  [43]  provides  an  essential  idea  used 
in  each  of  these  presentations.  See  also  Doughs  [25]. 

Equations  of  Hamilton-Jacobi  type.  See  Aizawa  [1],  Tamburro  [61], 
Fleming  [27],  and  Friedman  [28], 

Optimal  control.  We  know  of  only  one  paper  here,  namely,  Slemrod  [59], 

Functional  equations.  Simple  functional  equations  correspond  to  m- 
accretive  operators  A perturbed  by  Lipschitz  functions.  See  Webb  [64]  and 
Flaschka  and  Leitman  [26], 


Section  4 

Perturbation  theorems.  There  are  only  a few  general  known  results 
valid  in  arbitrary  spaces  X.  There  is  the  theorem  of  Webb  [62]  as 
generalized  by  Barbu  [2]  to  state  that  A + B is  m-accretive  if  A is  m-accretive 
and  B is  continuous  and  accretive.  A simpler  proof  and  other  results  arc 
given  in  Pierre  [56],  See  Martin  [58]  for  another  type  of  generalization  of 
Webb’s  result.  Calvert  and  Gustafson  [ 1 8]  deal  with  another  class  of  perturba- 
tions. Results  influenced  by  a class  of  applications  are  given  in  Benilan  [6] 
See  also  [63]. 

There  are  many  more  true  theorems  in  the  case  where  X is  a Hilbert 
space.  See  [II].  Some  of  the  proofs  of  these  work  if  only  X and  X*  are 
uniformly  convex.  See,  e g.,  Konishi  [35]. 
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Convergence  and  approximation  theorems.  There  are  many  types  of 
these  proved  in  many  places.  The  one  we  have  stated  is  found  in  [4. 
Chapter  V],  Kurtz’s  presentation  in  [44]  is  especially  interesting  and 
Goldstein  [29]  uses  a similar  idea  in  his  proof.  See  [31,  32]  for  recent 
results  of  a somewhat  different  sort.  Oharu  [55]  develops  approximation 
theorems  while  treating  a pde.  We  also  mention  [15,  52,  and  53], 

Asymptotic  behavior.  The  study  of  asymptotic  behavior  of  solutions 
of  (E)^  is  not  highly  developed.  See  [4,  III. 5]  for  results  in  Hilbert  spaces. 
Dafermos  and  Slemrod  [24]  prove  that  {S  4(t)x  : t > 0}  is  precompact  for 
x e D(A ) under  certain  assumptions  on  A and  then  apply  the  theory  of 
dynamical  systems  to  establish  a number  of  interesting  conclusions.  See 
also  Bruck  [17]. 


Section  5 

The  tangency  condition  (5.2),  with  lim  sup  replaced  by  lim  inf  [let  us  call 
this  (5.2)*]  was  used  already  by  Nagumo  [54],  Martin  [47]  proves  that 
(5.2)*  =>  (5.2)  if  B is  continuous.  Most  of  the  results  concerning  the  con- 
sequences of  (5.2)  use  the  fact  that  one  can  build  approximate  solutions 
to  (5.1)  (or  generalizations)  in  this  case.  In  this  regard,  it  may  be  helpful 
to  read  the  proof  of  [47,  Lemma  2]  before  pursuing  other  papers  on  this 
topic.  This  simple  proof  is  quite  close  to  Nagumo’s,  and  Martin’s  works 
[46-50]  delve  quite  deeply  into  the  implications  of  (5.2)  and  related 
conditions  and  contain  further  references.  See  also  RedhefTer  and  Walter  [57], 


Comments  on  Appendix  1 

After  the  preparation  of  this  manuscript  the  author  obtained  a copy  of 
Takahashi’s  paper  [60].  This  work  contains  more  general  results  concerning 
existence  and  convergence  of  approximate  solutions  of  backwards  difference 
schemes  than  those  previously  known  to  the  author.  Takahashi  uses  argu- 
ments related  to  those  of  Appendix  2 to  show  convergence,  and  his  proofs 
are  more  complex  than  in  our  Appendix  1 (which  is  still  sufficient  for 
known  applications).  In  fact,  however,  it  is  possible  to  extend  and  clarify 
Takahashi’s  results  by  direct  estimation  of  solutions  of  schemes  similar  to 
(A. 9),  which  we  will  do  elsewhere. 

Added  in  proof:  Significant  new  results  were  subsequently  obtained  by 
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Y.  Kobayashi.  Difference  approximation  of  evolution  equations  and  genera- 
tion of  nonlinear  semi-groups,  Proc.  Japan  Acad,  (to  appear),  and  M.  G. 
Crandall  and  L.  C.  Evans,  On  the  relation  of  the  operator  d/ds  -I-  d/dx  to 
evolution  governed  by  accretive  operators,  Israel  J.  Math,  (to  appear).  In 
particular,  Kobayashi  showed  that  S Jt)  can  be  defined  if  only  the  limit 
inferior  of  a-1  d(x,  R{I  + kA))  as  k tends  to  0+  vanishes  for  x in  the 
closure  of  D(A).  A more  general  condition  is  given  by  M.  Pierre.  Un 
theoreme  general  de  generation  de  semi-groups  non  lineaires  (to  appear). 


Appendix  1 

The  result  proved  here  represents  a refinement  of  the  proof  of  Theorem  3 
of  [30].  The  authors  of  [30]  had  kindly  provided  a rough  sketch  of  their 
proof  to  the  author.  (With  respect  to  [30],  it  is  presently  our  opinion  that 
there  is  no  advantage  in  working  with  the  piecewise  linear  functions  they 
use.  One  may  as  well  deal  w-?.h  the  step  functions  directly  as  is  done  here.) 
See  also  the  comments  in  Section  6. 

Let  A be  an  operator  in  X and  / = JO  = .s0  < .sy  < • • • < sN}  be  a partition 
of  [0,  sN\  with  steps  d,  = — ,s,_ ,.  Given  x0  e A",  consider  the  backwards 

difference  scheme 

(x,  — x,  _ j )/<5,  + Axt  s 0,  /'  = 1, . . . , N.  (D)y,  Xo 

This  generalizes  the  scheme  £“  '(u,(t)  — uf.(t  ~ £))  + Au  F(t ) 9 0 of  Section  1, 
which  corresponds  to  s,  = ic,  x,  = u,(w).  However,  here  we  are  willing  to 
tolerate  errors.  A sequence  {x,,  ...,  xNJ  c=  X is  called  a ( K , r\)-approximate 
solution  of  (D)//iJt  if 

(i)  3Zj  e (Xj  - xf_  l)/di  + Axt,  such  that  |(zi|j  < rj< 

for  i = 1 N,  (A.l ) 

(ii)  | x,-  — x.-.J/^j  < K,  for  i=l N. 

Here,  r]  measures  the  error,  while  K is  a Lipschitz  constant  for  the  piecewise 
linear  function  interpolating  between  the  values  x,  at  s, . Let  — {0  = t0  < 
h < " ‘ be  another  partition  with  steps  y j = tj  — fj_  i and  yo  e X.  We 
wish  to  compare  approximate  solutions  of  (D)y  Vo  and  (D)y  ,o.  Let 
Jy, yM}  be  a ( K , ^-approximate  solution  of  (D)y  Vo,  that  is. 


3wj  e (yj  - , )/yj  + Ay, 

such  that 

INI  ^ * 

for  j = 1, . 

. M, 

(A.2) 

yj'hj-yj-iW  z K 

for  j = 1, . 

M, 
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Theorem  A.1.  Let  A be  accretive,  (A.l)  and  (A.2)  hold,  and  <5,  < yf  for 
1 < i < N,  1 <j  < M.  Then 

|x,-  - yj\  < |x0  - y0|  + 2.v, t]  + K(stA  + \tj  - -v,- 12)’  2 

for  1 < i < N,  1 < j < M,  where 

A = max  {-/|  — <5k  : 1 < / < M,  1 < k < N }. 


Proof.  Let  = (/  + A/4)  *.  Then,  by  (A.li)  and  (A.2i) 

= Jrt,(x,_  1 4 <5;  z,),  y,  = J ,(yj-  t + y}  Wj).  (A.3) 

The  relation 


/ U A — U , \ 

J ■ X — J„[  . X + , J Xx 


(A4) 

l A A " / 

holds  for  x e D(J  J,  0 </,/*,  as  may  easily  be  checked  (or  see  [22,  Lemma 
1.2]).  Set  d{j , i)  = || Xj  - yj.  Using  (A.3)  and  (A.4)  we  have 

d(j,  /)  = Jrt,(x,-_ , + St z.)  - J...(y7_  i + y^wj)! 

yj  - % 


= j 1 + ^>Zi)  _ ' (JO-  1 + 7j  Wj)  + 'J  >’j 

\ l j ij 


< 


(Xj-  , + Si  z.)  - ( d'  y,_  ! + d,  Wj  + 1 J d‘  yj 
h 'j 


since  J,,  is  a contraction.  Set 

Oj , , = <5./7j  , Of  i=l-  0j.  i = (}’j  - «5i)/7j  ■ 

Note  0 < Ojt  i , Of , < 1.  Now  (A. 5)  implies 

4/,  0 < Oj'  i||xi_ , - y,_  , | + *llxi-  i - yj  + - wj\ 

< Oj'  id(j-\,i-i)  + Of  i d(j , i - 1)  + 2Si  tj, 

because  Ijz,-)),  ||wj  < rj.  For  notational  convenience,  let 


(A.5) 


(A. 6) 


(A.7) 


.*  = {(*,  l)  :k=  1 M and  / = 1 7V>, 

d.*  = {(fc,  /)  e .^  : / = 0 or  /c  = 0), 

.#°  = 


Also  we  know 

40,  i)  = | x(-  - y0|  < | Xj  - x„|  + |x0  - y0|  ^ Ksi  + 4>,  o , 
d(j,  0)  = \y,-x0\  < \yj-  y0\  + 1*0  - >0 1 ^ Ktj  + 4.0- 
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which  may  be  summarized  by 

d(kj)  < K\tk  - s,\  + d0  0 for  (k,  l)  e d.W.  (A.8) 

The  estimate  will  be  obtained  from  (A. 7)  and  (A.8)  by  a few  simple 
comparisons.  These  are,  for  the  most  part,  obvious  and  we  give  no  formal 
proofs. 

First  step.  Let  a\  -*  R satisfy  a > d on  d.it  and 

a(k,  /)  = 0k  ,a(k  - 1,  / - 1)  + Of l - 1)  on  3f°.  (A. 9) 

Then,  since  2strj  = 0k  ,(2s,_  ,t])  + Of  ,(2s,_  ,k/)  + 2 Stri, 

d(k , /)  < a(k,  /)  + 2(6,  + •••  4-  = a(k , /)  + 2s,  17. 

Second  step.  If  b,c  \ R both  solve  the  same  scheme  (A. 9)  and  c > b2 
on  0.#,  then  h < ^Jc  on  [If  b satisfies  (A. 9),  its  values  are  fixed  convex 

combinations  of  its  values  on  d Then  use  the  Schwartz  inequality.] 

Third  step.  The  solution  c:  ->  R of  (A.9)  with  c(/c,  /)  = \tk  - s,\2  on  d.jf 

satisfies 

\c(k,  /)  - |tk  - s,|2|  < s,A  on  M, 

where  A = max(7  i)e  , (y}  - <5,).  To  obtain  this  result,  set  r(k,  /)  = c(/c,  /) 
— |fk  — s,|2.  By  direct  calculation  [using  (A. 6)] 

r(k,  l)  = 0k  ,r(k  - 1,  / - 1)  + Of  ,r(/c,  / - 1)  + e(k,  l)  on  (A.  10) 

where 

e(k,l)  = Sl(yk-6l).  (A.  11) 

However,  (A.  10)  and  r = 0 on  d.tf  imply 

max  \r(k,  l)\  < max  \r(kj  - 1)|  + max  \e(k,  l)[ 

ISkSM  1 £ J ISM  1 SksM 

which  in  turn  implies 

1 

max  |r(/c, /)|<£  max  \e(kj)\.  (A. 12) 

1 £*£M  j=l  ISksM 

Finally,  (A.  12)  and  (A.  1 1 ) yield  the  estimate 

\r(k,  l)j  < .s,A,  A = max{(y;  - 3,)}.  (A. 13) 

To  complete  the  proof  of  Theorem  A.l,  observe  that  if  b is  the  solution  of 


EVOLUTION  GOVERNED  BY  ACCRETIVE  OPERATORS 


159 


(A. 9)  with  b(k,  l)  = |ft  — .s,|  on  d.tf,  then  a = Kb  + d0  0 is  a solution  of 
(A. 9)  with  a > d on  d.rf  [by  (A.8)].  However,  b < ^Jc , where  c is  as  in  the 
third  step.  Thus 

d(k , I)  < a(k,  l)  + 2s, r/  < Kh(k,  I)  + d0  0 + 2s,r\ 

< K[c(k,  /)]'  2 + d0  o + 2s,  rj 

< K{\tk-  s, | 2 + s'/ A)1  2 + d0  o + 2s,  tj, 
and  the  sketch  of  proof  is  complete. 

In  particular,  if  A is  accretive  and  x e D(A),  then  x,  = J u'x,  yj  = J Jx  are 
( K , 0)-approximate  solutions  of  the  schemes  defined  by  the  partitions 
0 < n < 2/i  < • • • < Nn,  0 < k < 2k  < ■ ■ • < MX,  respectively,  where  K = 

{ ||y||  : ye  Ax)  (for  this  last  fact,  see  [22,  Lemma  1.2]).  Thus 

I \Junx  - J?x  j|  < K((nn  - m/.)2  + n/.(X  - n))v2. 

Compare  this  with  (1.9)  of  [22]  (with  a>  = 0). 

Remark  1.  If  A is  not  accretive  in  Theorem  A.l  but  (1  — Xoj)J , is  a 
contraction  for  A > 0 and  small  with  a fixed  w (equivalently,  A + utl  is 
accretive),  (A. 7)  becomes 

d(j , i)  < UOj. , d(j  — l,i  — 1)  + 0* ,{j,  i - 1 )d(j\  i-l)  + 2S, r,),  (A.  14) 

where  C,  = ( 1 —S,ctj)~l.  To  handle  this,  simply  observe  that  d satisfies 
(A.  14)  iff  h(j,  i)  = cr  ‘cr-1!  • • • Cr  1 4A  0 satisfies 

h(j,  i)<0j,ih(j-\,i-l)  + 0*  ,{j,  i - 1 )h(j,  i - 1 ) + 2£f_1i  • ■ • ■ ■ Cf  '6,  t]- 

(A.  15) 

Assuming,  without  loss  of  generality,  {,■  > 1 for  i = 1 N,  then  h satisfies 

(A. 7)  and  (A.8)  in  place  of  d and  we  therefore  have 

d(k,  l)  < Ci  Udo.o  + 2 s,t]  + [(ik  - s,)2  + s,A)],/2) 

(A.  16) 

Ci  = (1  - S,(jj)  \ i = 1, ...,  k. 

Remark  2.  If  the  partitions  ff  and  & are  not  comparable  in  the  sense 
?>i  < Yj,  then  they  may  be  broken  into  comparable  “ pieces”  and  Theorem  A.  1 
can  be  used  on  each  pair  of  comparable  pieces.  We  do  not  make  this 
precise  here. 

Of  course,  one  can  perturb  these  ideas  in  a number  of  other  ways,  which 
we  leave  to  the  interested  reader.  The  author  is  indebted  to  S.  Parter  and 
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C.  DeBoor  for  a number  of  insightful  observations  about  the  difference 
scheme  (A. 7)  that  contributed  to  our  proof. 

Remark.  Be  sure  to  read  the  comments  on  Appendix  1 in  Section  6. 


Appendix  2.  Benilan’s  Uniqueness  Theorem 

The  proof  sketched  here  is  that  of  Theorem  1.1  of  [3]  (redone  with  ||x|| 
and  [x,  y]  + in  place  of  i||x||2  and  ||x|([x,  y]  + ).  The  generalization  in  [4] 
involves  somewhat  more  complex  statements. 

Let  IP  = (0  = a0  < Uj  < •••  < = T}  be  a partition  of  [0,  T]  with  steps 


Sk  = ak  — ak~ ! and  mesh  g(.'S)  = max  Sk . Let  x0, ... 
elements  of  X satisfying 

, x^  and  j'q  , . 

..,yN  be 

(xk  ~ xt_j)/5k  + Axk  3yk,  k=  1, 

(A.!7) 

and  u, /be  corresponding  step  functions 

u(t)  = xk  for  ak-l<t<ak,  k = 

J(t)  = yk  for  ak-x<t<ak , k = 

1,...,  Af, 

1 

(A.  18) 

When  the  above  holds  we  write  J. 


Theorem  A.2.  Let  f g e L‘([0,  T],  X),  A be  an  accretive  operator  in  X, 
and  v be  an  integral  solution  of  (E)8  on  [0,  T).  If  u e C([0,  T\  X)  and  there 
is  a sequence  {//„}  of  partitions  of  [0,  T ] and  functions  u„.ynJ„  such  that 
i in->u  uniformly  on  [0,  7],  and  ]n-*f  in  L‘([0,  7],  X ),  then: 

HO  - “(Oil  - 1140  - “(01  ^ I [40  - “H  <40  da 

* s 

for  0 < s < t < T. 

Sketch  of  proof.  Since  v is  an  integral  solution  of  (E)g, 

1140  - *11  ~ HO  - x\\  < | [v(x)  - x,  g{ t)  - y]+  dx  (A. 19) 

' 3 

for  y e Ax  and  0 < s < t <T.  Observe  next  that  for  a,  b,  e e X and  A > 0 

[a,  /)  + c]+  < [a,  b]+  + [a,  c]+  < [a,  h]+  + A~'(ll“  + Ac||  - ||a||) 
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as  follows  easily  from  the  definition.  Hence 

[00  - x,  g(x)  - >]+  < [00  - x,  g( t)  - y + w]  + 

+ A~  1 ( d(t)  — x — Aw | — ||00  — x|)  (A. 20) 

for  w e X.  Let  u,J as  above,  set 

x = xk,  y = yk-  Skl(xk-  xk_ j),  A = <5*,  w = ^;'(xt  - xk_,) 
in  (A. 20).  Then  plug  the  resulting  estimate  in  (A.  19)  to  find 

I 

<M||00  - xk  - ||t?(s)  - x*  ) < | (<5fc[c(r)  - x*,  g( t)  - yk]  + 

J S 

+ ||00  - xk_J  - || v(t ) - xk\\)dx. 

By  (A.  18),  the  above  may  be  written 

I (HO  - u(ff)l  - ||00  - u(o’)ll)  do 

* a* -i 

< I | [00  - ii(o),  g( r)  -./»]  + dx  do 
■ «* - 1 * * 

+ | Ht)  ~ «(fl*-2)||  - Ht)  “ “(flk-i)ll  dx • 

J S 

Summing  over  k = m,  m + 1,  n,  we  obtain 

. Cin 

| HO  - “(Oil  - HO  ~ “(Oil  da 

‘ “m  I 

< | | [00  - u(a),  g( x)  ~J(o)\+  dx  da 

' “m  I - ’ 

+ | (HO  - 2)11  - HO  - l)IIMT- 

* s 

Letting  u -»  u uniformly,  a„  -*  (i,  um_  j a,/-»/in  L1,  one  finds  the  inequality 

I (HO  - “(Oil  - 1100  - “(Oil) da 
• 2 

+ I (HO  - “(/0 II  - HO  - “(Oil) 

• s 

< | | [00  - u(a),  g(x)  ~f(o)]+  dx  da, 
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for  0 < a < (i  < T,  0 < s < t < T.  The  second  step  of  the  proof  is  to  show 
the  above  inequality  implies  the  inequality  of  the  theorem.  Letting 

<£(f,  a)  = HO  - u(a)||,  ip(t,  a)  = [»(*)  - u( a),  y(t)  -/( a)]+  , 

for  0 < r,  a < T,  we  have 

| (4>(t,  a)  - (p(s,  a))  da  + | (0(t,  fi)  ~ (p( x,  a))  dx  < ) | a)  dx  da. 

* 3t  * S *2*1 

(A-21) 

Assuming  0,  i/'  are  smooth,  divide  (A.21)  by  (fi  — a)(t  — s)  > 0 and  let 
/j  1 a,  s t t,  to  find 

d(b  . . d(b  . 

Tt  U ^ + Sol  ^ ^ ~ a^' 

Thus  (d/dt)((f)(t,  t ))  < ip(t,  f)  and 

r 

4>(t,  t)  < (p(s,  s)  + | ij/(a,  a)  da  (A.22) 

‘ S 

as  desired.  If  0 is  not  smooth,  let  p e C0  °°(R),  P>0,  p(r)  = 0 if  | z | > 1, 
J p = 1,  and  set 

</>£(t,  a)  = ) | p(z)p(w)(j)(t  — ez,  a — ew)  dz  dw , 

- Iz| ^ i * Ms  l 

a)  = I ) p(z)p(w)ij/(t  — ez,  a — ew)  dz  dw. 

* |z|£  1 •’  |w|S  1 

Then  0£ , ij/r  are  defined  and  smooth  on  e < t,  a < T — e.  Moreover  a direct 
calculation  verifies  that  (A.21)  holds  with  </>,  ij/  replaced  by  </>£,  ij/c.  Hence 
(A. 22)  holds  with  </>£,  [pc  on  e < s,  t < T — e.  The  theorem  results  from 
sending  e to  zero.  Since  d>  is  continuous,  the  convergence  Iim£l0  0£(f,  t)  = 

< p(t , 0 is  uniform  on  compact  subsets  of  (0,  T).  The  passage  to  the  limit  in 
i //£  is  not  quite  so  obvious,  but  in  fact 

I I 

lim  | i/f£(<r,  <t)  drr  < j \f/(a,  a)  da. 

e|0  ‘ z ‘ s 

To  see  this,  observe  that 

ij/(a  — ez,  a — ew)  = [l(<t  — ez)  — m(<t  — ew),  g(a  — ez)  — f (a  — ew)]  + 

< [v(a  - ez)  - u(a  - ew),  g(a)  -f(a)]  + 

+ ||/(<x  - ew)  -f(a)\\  + || g(a  - ez)  - g(a) ||, 
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and  use  the  semicontinuity  of  [ , ]+  together  with  the  continuity  of  u,  v and 
continuity  of  translations  in  L'([0,  7],  X)  [f{o  — ew)->f(a)  in  1}  as  ew  -»0, 
etc.].  Details  are  left  to  the  reader.  (In  the  important  cases  f = g = 0 and 
/ = 0,  g = y,  v = x for  ye  Ax,  f and  g are  continuous.  This  is  a more 
transparent  case.) 
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I.  Introduction:  The  Linear  Case 

Results  on  evolution  equations  in  infinite  dimensions  are  quite  different 
from  the  finite-dimensional  ones.  Even  in  the  linear  case  many  differences 
hold. 

If  £ is  a Banach  space  and  A a linear  operator  from  some  domain  D(A) 
of  E into  £,  can  we  solve  the  problem 

(du/dt)  + Au  = 0 for  t > 0,  u(0)  = u0  s E, 

in  such  a way  that  the  solution  will  be  given  by  a semigroup 

u(t)  = S(t)u0  with  S(t)  e &(E,  £), 

S(t  + s)  = S(t)S(s)  with  5(0)  = /, 

satisfying  the  continuity  condition 

for  all  e e E,  S(t)e  -*  e as  t -*• 0? 

The  answer  is  given  by  a well-known  theorem  of  Hille  and  Yosida  (and 
also  Philipps,  Miyadera,  Feller,  et  ai),  which  says  that  Eq.  (1)  defines  a 
semigroup  satisfying  (2),  (3),  and 

||S(f)|U£,£)<Me"'  for  t > 0,  (4) 

if  and  only  if  the  following  conditions  are  satisfied: 

(i)  D(A)  is  dense  in  £, 

(ii)  XI  + A has  a bounded  inverse  for  X real  and  large 

enough  satisfying  the  estimate  (5) 

II  (XI  + AyWsiE.E)  < M/(X  - Ojf  for  n=  1,  2 


(1) 

(2) 

(3) 
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Let  us  emphasize  the  differences  from  the  finite-dimensional  case  by 
making  some  remarks. 

Remark  1.  In  finite  dimensions  one  always  has  D(A)  = E.  This  is  no 
longer  true  in  infinite  dimensions;  the  case  D(A)  = £ corresponds  to  ordinary 
differential  equations  in  Banach  spaces,  whereas  the  case  D(A)  # £ leads  to 
more  difficult  problems  and,  in  most  concrete  examples,  to  partial  differential 
equations. 

Remark  2.  In  finite  dimensions  one  can  define  the  solution  of  (1)  for 
t > 0 as  well  as  for  t < 0.  In  general,  this  is  not  true  in  infinite  dimensions 
where  the  existence  for  t < 0 is  equivalent  to  condition  (5)  for  — A. 

Example  1.  £ = £2([R),  A — d/dx,  with  D(A ) = //’((R);  then  problem  ( 1 ) 

is  well  posed  for  t > 0 as  well  as  for  t < 0.  In  this  example  condition  (4) 
is  satisfied  for  M = 1 and  to  — 0. 

Example  2.  £ = L2(1R),  A = -d2/dx2,  with  D(A)  = H2(U);  then 
problem  (1)  is  well  posed  for  f>0  but  not  for  t < 0.  Condition  (4)  is 
also  satisfied  with  M = 1 and  oj  — 0. 

Example  3.  £ = £2(IR)  x A{u,  v)  = ( dv/dx , du/dx),  with  D(A ) = 

Hl(U)  x Hl( R);  then  problem  (1)  is  well  posed  for  all  t and  (4)  is  still 
satisfied  with  M — 1 and  to  = 0. 

Remark  3.  Except  for  the  case  where  u0  e D(A),  the  solution  u satisfies 
(1)  in  a generalized  sense  and  u(t)  need  not  be  in  D{A).  However,  for 
u0eD(A),  (1)  can  be  interpreted  in  a classical  sense:  u is  a C1  function 
of  t > 0 with  values  in  £ and  a C°  function  of  f > 0 with  values  in  D(A) 
(which  is  a Banach  space  with  the  norm  \u\\  + ,-lt<  ). 

Some  linear  operators  A have  the  property  that,  even  if  u0  i D(A),  then 
u(t)e  D(A ) for  t >0.  Examples  1 and  3 do  not  possess  this  property  but 
Example  2 does.  Most  of  the  examples  where  this  property  holds  can  be 
stated  in  the  following  form: 

H will  be  a Hilbert  space  (which  plays  the  role  of  £)  and  V another 
Hilbert  space  continuously  embedded  into  H and  dense  in  it.  H being 
identified  with  its  dual,  we  have  V cz  H e V.  Let  A be  linear  and  continuous 
from  V into  V and  satisfy  the  coercivity  condition  ( Au , w)  > a || u |j v2  and 
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0(/l)  will  he  the  set  {u  e V : Au  e //}.  Then  the  semigroup  generated 
satisfies  (4)  with  M = 1 and  ro  - 0 and  also  satisfies 

\Au(t)\„  < (c/t)\u0\„  for  t > 0. 

In  Example  2,  V will  he  // 1 (tR). 

Remark  4.  As  Problem  (1)  for  I < 0 is  not  generally  well  posed,  it  may 
happen  that  the  solutions  starting  from  2 different  points  u0  and  v0 
coincide  after  some  time  f0 . 

Example  4.  £ = £2(0,  1),  A = d/dx,  with  domain  D(.4)  = {ue£, 

du/dx  e £,  u( 0)  = 0};  then  S(t)  = 0 for  t > 1,  and  the  solution  is  explicitly 
given  hy 

10,  if  x < f, 

u(x,  t)  = . . ,f  , 

' |u0(x  - f),  if  t < X < 1. 

Remark  5.  Except  for  the  case  where  D(A)  = £,  the  proof  of  existence  of 
the  solution  of  (1)  as  a fixed  point  of  the  mapping  u u0  - j'0  Au(s)  ds 
does  not  work,  because  this  mapping  transforms  continuous  functions 
with  values  in  D{A)  into  continuous  functions  with  values  in  £ and  thus 
cannot  be  iterated. 

For  the  same  reason  the  classical  explicit  difference  scheme 
[u((n  + 1 )k)  — u(nk)]/k  + Au(nk)  = 0 

does  not  generate  an  approximation  of  the  solution  of  (1)  as  it  is  impossible 
in  general  to  define  u[nk)  for  all  n. 

Actually  the  implicit  difference  scheme 

[u((n  T I )k)  — u(nk)\/k  + Au((n  + l)k)  = 0 (6) 

is  well  defined  for  small  k > Oand  generates  an  approximation  of  the  solution. 


II.  The  Nonlinear  Case:  Approximation 
by  Finite-Dimensional  Problems 

In  this  section  we  will  consider  infinite-dimensional  problems  as  the  limit 
of  ordinary  differential  equations  in  finite  dimensions;  this  is  the  method  of 
Galerkin  (associated  generally  with  Faedo  or  Ritz),  which  we  will  use  in  some 
simple  cases. 
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Let  V be  a separable  reflexive  Banach  space  continuously  imbedded  and 
dense  in  some  Hilbert  space  H identified  with  its  dual.  Thus  we  have 

L c //  cV\  (7) 

Let  A be  a nonlinear  operator,  which  may  depend  on  t,  from  V into  V. 
We  are  interested  in  the  equation 

(du/dt)  + A(u)  = 0,  0 < t < T < + oo,  u(0)  = u0e  H.  (8) 

Step  1.  We  choose  a “basis”  wt, w„, ...  of  V,  that  is, 

for  each  n,  wlt w„  are  linearly  independent; 
the  subspace  spanned  by  all  the  w,  is  dense  in  V. 

As  V is  separable,  such  a basis  exists.  Usually  the  following  steps  can  be 
carried  out  for  each  choice  of  the  basis,  but  in  some  cases  it  is  useful  to 
choose  a special  basis. 


Step  2.  For  fixed  n,  consider  the  ordinary  differential  equation 

n 

find  un  = £ 0,(t)w,  such  that 

i = i 

\^+AMwjj=Q  for  j = 1, . . . , n,  t>0,  (10) 

n 

«U0)  = “On  = Z *iWi- 
i = 1 

As  the  w,  are  independent,  the  matrix  with  entries  (w(,  Wj)  is  invertible 
and  (10)  is  an  ordinary  differential  equation  for  glf  ...,  g„.  To  obtain  local 
existence  on  an  interval  [0,  f„[  with  0 < t„  < T,  we  will  assume  that  A satisfies 

for  each  finite  dimensional  subspace  W of  V,  the 

restriction  Aw  from  W into  W'  defined  by  (11) 

(Aw(u),  v)  = ( A(u ),  v)  for  all  u,  ve  W is  continuous. 


Step  3.  Prove  that  t„  = T for  all  n and  find  a priori  estimates  on  un 
independent  of  n.  Usually  one  can  show  that  if  u0„  is  bounded  in  H (we  will 
suppose  that  u0n-*u0  in  H),  un(t)  remains  in  a bounded  set  of  H;  this 
proves  that  t„=  T and  gives  the  estimate  u„  e bounded  set  of  L°°(0,  T;  H). 
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Step  4.  If  u0n  -»  u0  in  H and  if  we  have  a solution  un  that  remains  in 
some  bounded  set  of  a reflexive  Banach  space  [or  a dual  of  a Banach  space 
such  as  L°°(0,  T ; H)]  then  we  can  extract  a weakly  (or  weakly*)  convergent 
subsequence  to  some  u,  which  we  hope  will  be  a solution  of  (8).  There  are 
two  general  arguments  to  carry  out  step  4:  compactness  and  monotonicity. 
Let  us  begin  with  the  results  about  compactness. 

Compactness  Lemma.  If  B0  <=  Bx  <=  B2  are  three  Banach  spaces  with 
compact  injection  from  B0  into  Bt  and  continuous  injection  from  Bx  into 
B2,  then  if  une  bounded  set  of  Z?°(0,  T;  B0)  and  dujdt  e bounded  set  of 
Z?‘(0,  T ; B2)  with  T < + oo  and  1 < p0 , Pi  < + oo,  then  u„  e compact  set  of 
#“( 0,  T;  Bi). 

Compactness  Argument.  In  this  method  we  try  to  obtain  a sufficient 
number  of  estimates  on  un  and  dujdt  such  that,  using  the  compactness 
lemma,  we  can  extract  from  u„  a strongly  convergent  subsequence  in  some 
Banach  space  (and  weakly  convergent  in  other  spaces)  implying  the  con- 
vergence of  A(un)  to  A(u). 

Let  us  remark  that  in  some  cases  the  choice  of  a special  basis  in  step  1 
appears  to  be  useful  to  obtain  estimates  on  dujdt. 


Example  5 


(d2u/dt2)  — Au  + u3  = 0, 
u(0)  = u0e  Hl(RN)  n L4( RN)  = V, 
(du/dt)( 0)  = uj  e L2(RN)  = H. 


(12) 


This  can  be  considered  as  a system  (u,  t;)  with  v = du/dt,  but  it  is  more  con 
venient  to  work  with  only  the  unknown  u and  to  approach  (12)  by 

n 

un  = Z Mwi  * where  w , is  a basis  of  V, 

i=i  (12') 

(( d2ujdt 2)  - A u„  + uj,  w,)  = 0,  i = 1, . . . , n. 

Taking  in  (12)  the  combination  of  w(  corresponding  to  dujdt,  we  obtain 

dill  du.  2 1 . 1 . \ 

i, (2 1 * e + 2 |grad  "■l??  + 4 |u”fe)  = a <13> 
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This  gives  the  estimates 

un  e bounded  set  of  L°°(0,  T;  V ),  , . 

dujdt  e bounded  set  of  L°°( 0,  T;  H). 

At  this  point  let  us  remark  that  if  we  have  a weakly*  convergent  sequence 
in  L°°(0,  T;  V)\  V„  -*  V,  it  does  not  imply  that  Vn3  converges  to  V3. 

Now  with  the  estimate  on  dujdt  and  using  the  compactness  lemma,  we 
have  u„  e compact  set  of  L2( 0,  T;  LI20C(IR'V)).  [The  injection  from  //*( R*)  into 
L2(R*)  is  not  compact,  but  this  is  true  if  R*  is  replaced  by  a bounded 
open  set  of  Rw.] 

We  can  extract  a subsequence  converging  in  L®( 0,  T;  V)  weakly*  and 
strongly  in  L2( 0,  T;  Lj^R*)),  which  gives  un3  -*•  u3  in  LF(0,  T;  L^R*))  for 
p < + oo,  and  so  u will  be  a solution  of  (12). 

Monotonicity  Argument.  Let  us  assume  that  A maps  U(0,  T ; T)  into 
Lp  (0,  T;  F')with  1 < p < + oo  and  (1/p)  + (1/p')  = l,and  that  A is  monotone 
and  hemicontinuous.  A monotone  means  that 

[ T(A(u)  — A(v),  u — v)dt  >0  for  u,  v e Lp{0,  T,  V),  (15) 

• o 

and  A hemicontinuous  means  that  the  map 

rT 

(A(u  + Xv),  v)  dt  is  continuous  for  X e R,  u,  v e Lp(0,  T;  V). 

(16) 

Under  these  conditions  on  A one  has: 

Monotonicity  Lemma.  If  un  converges  weakly  to  u in  Lp(0,  T;  V ), 
A(un)  converges  weakly  to  /in  Lp(  0,  T;  V'),  and 

tT  rT 

hm  sup  (/l(u„),  u„)  dt  < (/  u)  dt, 

J0  J 0 

then  A(u)  =f 

Let  us  remark  that  A may  have  this  property  without  satisfying  (15)  and 
(16).  If  we  extract  a subsequence  such  that  u„  -» u,  A un  -*f  [assuming  that 
A maps  bounded  sets  of  Lp( 0,  T;  V)  into  bounded  sets  of  If  ( 0,  T;  L)],  and 
uH(T ) -♦  h,  we  see  from  (10)  that 

(du/dt)+f=  0,  u(T)  = h. 
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and 

T M7’)|2+  [ (A(Un),  “n)dt  = i|M„(0)|2  — i|M0  |2. 

•’o 

So 

limsupf  {A{un),  un)dt  <±\u0\2  -$\h\2  = [ (fu)dt, 

Jo  Jo 

and  by  the  monotonicity  lemma  / = A(u)  and  u satisfies  (8). 

Example  6.  If 

{du/dt)  - Am  + u3  = 0,  u(0)  = u0  e 1?{MN),  (17) 

then  V = Hl(UN)  n I?{MN),  H = I?{UN),  and  A{u)  = — Au  + n3  is  monotone 
hemicontinuous  from  W = l}{ 0,  T;  Hl{UN))  n I?{ 0,  T;  I?{MN))  into  its  dual 
W'.  With  a slight  modification  the  monotonicity  argument  works.  The 
estimates  are  obtained  in  this  example  by  taking  in  (10)  the  combination 
of  iVj  corresponding  to  u„.  This  gives 

\ + lsrad  mJ£2  + kll<  = o 

u„  e bounded  set  of  L2(0,  T;  H1(UN)),  L4( 0,  T;  L4(R*)),  and  L°°(0,  T;  L°°(nN)). 

(18) 

Example  7 

{d2u/dt2)  - Au  + {du/dt)3  = 0, 

u(0)  = u0  e Hl{UN),  {du/dt){0)  = u,  e L}{MN).  [ ' 

If  we  consider  the  approximation 

ft 

u„  = £0,(t)w,  and  w,  is  a basis  of  Hl{UN)  n I?{RN), 

<=i  (20) 

(( d2ujdt 2)  - Amb  + ( dujdt )\  w,)  = 0,  j = 1, . . n,  t>  0, 

choosing  the  combination  of  the  w,  corresponding  to  dujdt,  we  have 

u„  e bounded  set  of  L®( 0,  T;  Hl{UN)), 
dujdt  e bounded  set  of  L*(0,  T;  L2{UN))  n L4( 0,  T;  L4(R*)).  ( ' 

Then  if  vH  = dujdt,  we  can  apply  the  monotonicity  lemma  to  v„  and  deduce 
that  for  a subsequence  we  will  have  un->u  weakly,  dujdt  -» du/dt  weakly, 
and  {dujdt)3  -*  {du/dt)3  weakly. 
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Some  examples  may  require  a combination  of  a compactness  argument 
and  a monotonicity  argument;  some  may  be  handled  (with  different 
hypotheses)  by  the  two  methods. 


Example  8 

(du/dt)  - Au  + u3  = 0,  u(0)  = u0  e H1(RN)  n I?(RN)  (22) 


[compare  to  (17)].  Taking  in  (10)  the  combination  of  corresponding  to 
dujdt,  we  have 


dt 


l}  + \i  |grad  U'$  + 1T.  = 


4 dt 


u„  e bounded  set  of  L°°(0,  T;  Hl( R*))  n L°°( 0,  T;  L*(UN)), 
dujdt  e bounded  set  of  L2(0,  T ; L^R*)). 


(23) 


Now  the  estimate  on  dujdt  allows  the  use  of  the  compactness  argument  as 
in  Example  5. 


III.  The  Nonlinear  Case:  Other  Methods 

1.  ACCRETIVE  OPERATORS 

A simple  case  of  (5)  is  where  M = 1 and  (5ii)  reduces  to 

\\(M  + A)~  l\\  < 1/(A  — (o). 

A nonlinear  analog  of  this  inequality  leads  to  the  theory  of  accretive  and 
monotone  operators.  For  these  results  we  refer  to  the  chapter  by  Crandall 
in  this  volume. 


2.  FIXED-POINT  THEOREM  FOR  CONTRACTIONS 

We  want  to  find  the  solution  u of  (8)  by  means  of  the  iterative  method 

(dU,tltft)  + B(U„t)  = B(U,)-A(U,),  U..t( 0)=Ko,  (24) 

with  a suitable  B such  that  un-^*un+l  defines  a map  S that  is  a strict  con- 
traction in  some  Banach  space  so  its  unique  fixed  point  will  be  the  desired 
solution. 
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Example  9 

du 

Jt 


— An  + 


<p(x,  u, 


du 

dx 


= 0,  u(0)  = uoeHl(UN),  (25) 


dtp  , 

to  (X’  V) 


<K, 


dip 

dv 


(x,  u,  v) 


where  tp  satisfies 

<p(x,  0,  0)  e L2(Rn), 

Using  B(u)  = — Am  and  the  estimate 

(dw/dt)  - Aw  =/  implies  ||w(f)||J,.  < c [ |/(s)|£2  dx, 

J fl 


<K.  (26) 


(27) 


w(0)  = 0, 

we  have  the  existence  (and  uniqueness)  of  u satisfying 

u e L^O,  T;  Hl(UN)),  du/dt  e L°°(0,  T\  L2(UN)).  (28) 


3.  FIXED-POINT  THEOREM  FOR  COMPACT  MAPPINGS 

We  use  the  same  map  S defined  by  (24)  but  we  now  require  that  S is 
continuous  and  maps  some  compact  convex  set  of  a Banach  space  into  itself 
in  order  to  apply  the  fixed-point  theorem  of  Schauder. 


Example  10 

du 


dt 


— Am  + 


«>(*, ) = 


0 in  fl  a bounded  open  set  in  R*, 


u(0)  = u0  e H0l(Cl), 


(29) 


and  tp  satisfies 

<p(x,  m,  v)  continuous  in  u,  v,  measurable  in  x, 

\<p(x,  u,  r)|  < a(x)  + 6|uj  + c|u|  with  fl(x)eL2(fl),  b,  c > 0. 

We  use  the  map  defined  by  B = - A and  we  use  the  following  estimate: 
if  (dw/dt)  - Aw  =/e  L2( 0,  T;  L2(Q)),  we  have  w(0)  = w0  e H0l(Q), 


(>)  MOIlfco'  ^ M°)fl&0'  + ^ jj/(s)|£?  ds, 

(ii)  [ (\dw/dt\h  + |Aw|£:)  dt  < c[|w(0)||?,o.  + f \f(s)\2  ds]. 
J0  *0 


176 


LUC  TARTAR 


Estimate  (i)  shows  that  we  can  find  a convex  set  of  L2( 0,  T ; H0l(Q))  that  is 
mapped  into  itself.  Estimate  (ii)  shows  that,  using  the  compactness  lemma, 
this  map  is  compact  and  we  can  obtain  a fixed  point. 


4.  FIXED-POINT  THEOREM  FOR  INCREASING  MAPPINGS 

Using  some  map  S,  such  as  in  (24),  we  now  require  that  S is  increasing 
in  some  ordered  Banach  space  such  that  we  can  apply  some  special  fixed- 
point  theorem  for  increasing  mappings. 

In  most  cases  we  apply  some  fixed-point  theorem  used  in  Section  III.2 
or  III.3.  The  order  argument  is  used  to  find  a suitable  convex  set  as  well 
as  a suitable  operator  B. 


Example  11 

( du/dt ) - Am  4-  (p(x , u ) = 0,  u(0)  = u0  e l}{UN), 

(p  continuous  in  u,  measurable  in  x. 

We  assume  that  we  can  find  two  functions  a,  b satisfying 

a,  be  I}( 0,  T;  L2[UN )) 

dci  db 

~ — An  + tp{x,  a)  < 0 < - — A b + (p(a,  b)  if  a(x,  0 ) <u0  < b(x,  0), 
at  at 


(32) 


<p(x,  u)  e L2( 0,  T ; I}(UN))  if  a < u < b, 
dtp. 


(33) 


dx 


(x,  u)  < K if  a(x)  < u < b(x). 


Then 


(34) 


(35) 


there  is  a solution  of  (32)  satisfying  a <u  <b,  and 
{du/dt)  - A ue  I}{ 0,  T;  L2{UN)). 

For  this  we  consider  v = Su  defined  by 

(dv/dt)  — Av  4-  Kv  = Ku  - tp{x,  u),  y(0)  = uQ , 

and  remark  that  K has  been  chosen  so  that  Ku  — <p(x,  u)  is  increasing  on 
the  interval  [a,  b]. 

Now  the  crucial  point  is  the  maximum  principle  for  the  operator  — A + K : 

if  {dw/dt)  — Aw  + Kw  > 0,  w(0)  > 0,  then  w > 0.  (36) 

The  hypotheses  on  a , b imply  that  S is  increasing  and  maps  the  interval 
[a,  b]  into  itself,  and  so  S has  a fixed  point. 
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Example  12 


du  du  , , 

+ +U*-V2=0, 


dv  dv 
dt  dx 


dt  dx 

u(0)  = u0  e L®(R),  v{0)  = v0e  L*(R), 

For  K > 2 M we  consider  the  iterative  process 


dun+l  dun+ 1 

+ — — + Ku  1 = Kun 


dt 


dx 


+ v2  — u2  = 0, 

0 < u0 , v0  < M. 


(W)2  + (if)2. 


(37) 


dif+l 
~ dt~ 


dif+l 


+ Kif+1 


= Ktf- 


(rf)2  + (if)2, 


(38) 


i/n+ 1 (o)  = uq  , r+1(o)  = v0. 

Note  that  K has  been  chosen  so  that  Ku  - u2  + v2  and  Kv  - v2  + u2  are 
increasing  for  0 < u,  v < M. 

The  crucial  point  is  that 


if 


dw  dw 

-T-  + a — 

dt  dx 


+ Kw  > 0 


w(0)  > 0, 


then  w > 0; 


(39) 


then  (38)  defines  an  increasing  mapping  (un,  if)->(un+l,  r"+1),  which  is 
Lipschitz  continuous  and  maps  0 < u,  v < M into  itself.  So  (37)  has  a unique 
solution  such  that  0 < u,  v < M.  This  approach  proves  that  the  solution  u,  v 
increases  if  u0,  v0  increases.  Using  this,  one  can  also  show  that 
||u(f)||i«  + ||r(t)||L«  decreases. 


5.  OTHER  METHODS 

Of  course  there  are  many  other  methods.  One  always  has  to  improve 
some  classical  methods  in  order  to  fit  a particular  problem. 

The  examples  given  here  are  not  given  in  the  most  general  form  and  we 
have  restricted  ourselves  to  questions  of  existence.  Problems  of  regularity, 
periodic  solutions,  and  asymptotic  behavior  of  the  solutions  can  be  solved 
for  these  examples. 

For  more  details  and  examples  let  us  give  a single  reference: 

J.  L.  Lions,  “Quelques  Methodes  de  Resolution  des  Problemes  aux  Limites  non  Lineaires.” 
Dunod  Gauthier  Villars,  Paris,  1969. 
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I had  the  good  fortune  of  knowing  Professor  Lefschetz  for  a number  of 
years.  As  with  so  many  other  young  people,  he  had  a profound  influence 
on  my  outlook  toward  life  and  the  direction  of  my  research.  Although  I had 
an  interest  in  functional  differential  equations  before  going  to  RIAS  in  1958, 
it  was  during  a conversation  with  Lefschetz  that  he  told  me,  “You  should 
read  the  Russian  book  on  stability  by  Krasovskii.”  Naturally,  he  was  right 
and  my  serious  research  on  this  subject  began  at  this  point.  The  book  of 
Krasovskii  has  had  a tremendous  impact  on  later  developments.  And,  once 
more,  the  deep  perception  of  Lefschetz  showed  through  even  in  areas  not 
of  direct  interest  to  his  research.  It  is  indeed  an  honor  to  dedicate  this 
chapter  to  Professor  Lefschetz. 


1.  Introduction 

The  number  of  stimulating  chapters  on  functional  differential  and  integral 
equations  presented  in  this  volume  is  a clear  indication  of  the  rapid  develop- 
ment of  the  theory  over  the  past  few  years.  The  literature  has  become  so 
extensive  that  it  is  now  impossible  in  a few  pages  to  give  a clear  picture 
of  the  spectrum  of  current  research.  Therefore,  I will  confine  my  attention 
to  a special  class  of  neutral  equations  (which  will  include  retarded  ones 
with  finite  delay)  and  to  those  results  for  this  class  that  are  of  particular 
interest  to  me. 


• This  research  was  supported  by  the  National  Science  Foundation  under  GP-28931X2 
and  in  part  by  the  U.S.  Army  Research  Office,  Durham,  under  DA-ARO-D-31-124-73-G-130. 
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In  general  terms,  a neutral  functional  differential  equation  (NFDE)  is  a ^ 

differential  relation  in  which  the  derivative  of  the  unknown  function  may 

depend  on  past  values  of  the  function  as  well  as  its  derivative.  In  contrast  4 

to  retarded  equations,  even  the  basic  problem  of  existence,  uniqueness,  and  • 

continuous  dependence  does  not  have  a straightforward  solution.  The  ] 

manner  in  which  the  norm  on  the  initial  data  depends  on  the  derivative 
plays  a significant  role.  This  was  pointed  out  in  the  fundamental  paper  [8], 
which  contains  many  interesting  examples  as  well  as  a solution  of  the  above 

problems  for  equations  linear  in  the  derivatives  and  initial  functions  < 

absolutely  continuous.  Extensions  were  given  in  [30,  31].  A glance  through 
Mathematical  Reviews  will  show  that  there  is  a tremendous  amount  of 

work  being  done,  especially  in  the  USSR,  on  neutral  equations  with  initial  | 

data  that  have  derivatives  whose  pth  power  is  integrable.  However,  a ] 

qualitative  theory  has  not  yet  been  developed. 

In  [21]  in  1967,  a particular  class  of  neutral  equations  was  introduced  ; 

for  which  initial  data  could  be  specified  in  the  space  of  continuous  functions.  j 

In  subsequent  years,  this  class  was  further  restricted  in  such  a way  that  a 

qualitative  theory  now  seems  possible.  It  is  this  class  that  will  concern  us  \ 

in  this  chapter.  With  the  knowledge  we  now  have,  it  seems  possible  that  a 
similar  theory  could  be  developed  for  initial  data  in  other  spaces. 


2.  Notation 

Suppose  r > 0 is  a given  real  number,  £"  an  n-dimensional  linear  vector 
space  with  norm  | |,  and  C = C([  — r,  0],  £")  the  space  of  continuous 
functions  mapping  the  interval  [ — r,  0]  into  £",  with  |<£|  = sup_r50sO  j<£(0)| 
for  (p  e C.  Let  30  be  the  space  of  bounded  linear  operators  from  C to  £" 
with  ||B||  = inf  {k  : \B(f)\  < k\(f>\,  (f>  e C}  for  any  B e 30.  For  any  B e 30,  there 
is  an  n x n matrix  function  p on  ( — oo,  oo)  of  bounded  variation  such  that 
p(0)  = p(-r),  d < 0,  p(0)  = p{0),  6 > 0,  and 

B0  = f°  [dp(0M0) 

J -r 

for  all  </)  e C.  Furthermore,  there  is  a continuous,  nonnegative  scalar  function 
y on  [0,  oo ) such  that  y(0)  = 0, 

f°[rf/1(#)]0(0)-MO)-MO-)WO) 


< y(s)  sup  \<f>(s)\ 

-s<0^ 0 
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for  s e [0,  r],  </>  e C.  We  say  B is  atomic  at  0 if  det[/i(0)  — /i(0_)]  / 0 and 
B is  nonatomic  at  0 if  /v(0)  — /i(0_ ) = 0.  Notice  that  to  say  B is  atomic  is  not 
to  negate  the  statement  that  B is  nonatomic  except  for  n = 1.  In  the  same 
way,  one  can  define  atomic  and  nonatomic  at  any  point  9 e [ — r,  0]. 

Let  fi  be  the  closure  of  an  open  set  Q of  C,  k > 0 an  integer, 

,?k  = {/:□-»  En,fu\j  = 0,  l,....  It,  continuous  and  bounded  in  H}, 

where  fU)  is  the  7th  Frechet  derivative  of  f.  For  any  / e 3Fk,  we  define  the 
norm  of/by  j|/||  = sup*en{£}=o  ||/J'(</>)||},  where  ||/°(0)||  = \f(<j>)\  and, 
for  j > 1,  the  double  bars  represent  the  norms  for  multilinear  operators. 

If  A > 0 and  x:  [o  — r,  a + A]  ->  En,  then  for  any  te[o,  t r + A]  we  let 
xt:  [ — r,  0]  -»  E"  be  defined  by  x,(9)  = x(t  + 9),  — r<9<  0. 

For  any  (D,  x ,Fk  with  D atomic  at  zero,  we  define  a neutral 

Junctional  differential  equation  by  a relation 

-dtD(x,)=f{*  r)  (2-1) 


and  designate  it  by  NFDE  (£>,/).  For  any  4>  e ft,  a solution  x(o,  0)  through 
(ct,  </»)  e R x C is  a continuous  function  x:  [rx  — r,  a + A) ->  E"  for  some 
A > 0 such  that  x,($)e  Q,  t e [<r,  a 4-  A ),  x„  = $,  and  Dx,  is  continuously 
differentiable  on  [ct,  a + A)  and  satisfies  the  equation  on  [<x,  a + A).  If  a - 0, 
we  designate  the  solution  by  x(</>). 

By  far  the  most  interesting  equations  have  the  form 


d 

dt 


x(t)  - £ Bkx(t  - rk) 


k=  l 


= /(*/)< 


(2.2) 


where  each  Bk  is  an  n x n matrix  and  each  rk  > 0. 


3.  Existence,  Uniqueness,  Continuous  Dependence 

If  (D,  x &°,  then  for  any  there  exists  a solution  of  the 

NFDE  (£>,/)  through  (0,  </>)  on  a maximal  interval  [0,  A^,)  such  that  either 
A0  = + oc  or  lim,^^  |x, | = + oo  or  lim,_,44  x,  = ip  exists  and  i jt  e 9C1.  If,  in 
addition,  (D,/)e.#  x then  the  solution  x(</>,  D,/)(f)  through  (0,  </>) 
depends  continuously  on  (<£,  D,/,  ()efix  J x,f  x [-r,  A+)  (see  [4,  10, 
30,  31]). 

A function  x:  ( — r — A,  0]  -» £",  A > 0,  is  said  to  be  a backward  con- 
tinuation through  (0,  0)  for  the  NFDE  (D, /)  if  x0  = <p  and  Dx,  is  con- 
tinuously differentiable  on  ( — A,  0]  and  satisfies  the  equation  on  ( — A,  0]. 
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If  D is  atomic  at  -r,  (D,f)eJS  x .F0,  <f>  e Q,  then  there  is  a backward 
continuation  through  (0,  (f>)  and  the  solution  x(</>,  D,  f)(t)  depends  con- 
tinuously on  (</>,  D,f  t)  e Q x x ;?1  x (-r  - A#,  0],  where  A#  is  defined 
by  the  maximal  interval  of  existence  (see  [10]). 

Define  the  solution  operator  of  an  NFDE  (D,f)  by 

td,  f(t):  q - n,  rDi  f(t)<f,  = x,(</>,  dj), 

for  t e [0,  A 0).  From  the  above  remarks,  if  D is  atomic  at  — r,  then  TD  f(t) 
is  a homeomorphism.  Since  the  set  of  De  J for  which  D is  atomic  at  — r 
is  an  open  dense  set  in  .*#,  it  follows  that  the  set  of  NFDE  (D,f)  for  which 
Tp  f(t)  is  a homeomorphism  is  open  and  dense  in  x (see  [10]). 
Consequently,  any  retarded  functional  differential  equation 

*(0  =/(*»)  (31) 

can  be  approximated  in  .<A  x 3FX  by  an  NFDE  for  which  TD  f(t)  is  a 
homeomorphism;  in  particular,  by 

^[x(r)- ex(f  - r)]=/(xf).  (3.2) 

It  would  be  interesting  to  know  how  the  solution  operator  TD  f(t)  of  (3.2) 
approaches  the  solution  operator  TD  f(t)  of  (3.1)  as  e -»0. 

The  above  results  on  continuous  dependence  are  not  sufficiently  general 
to  handle  many  practical  problems.  More  specifically,  if 

D(f)  = </>(0)  - ^ Bk(p(  — rk)  = (f)(0)  - g(4 >),  (3.3) 

*=  i 

where  each  Bk  is  an  n x n matrix  and  rk  > 0,  the  operator  D e is  not 
continuous  in  the  parameters  rk,  k = 1,  2,  ...,  N.  On  the  other  hand,  it  is 
shown  in  [8,  30]  that  the  solution  is  continuous  in  the  rk  provided  each 
rk  > 0.  Furthermore,  this  continuity  property  even  holds  as  each  rk  ->  0 
provided  that  ||g||  < 1.  For  n = 1,  this  latter  condition  is  equivalent  to 
BLi  |Bk|  < 1.  In  [32],  it  is  shown  that  for  n = 1 the  inequality  £k=  t |Bk|  > 1 
implies  that  in  any  neighborhood  of  0 e Rn  there  are  positive  (r,,  . . , rN)  and 
a root  A of 

1 - £flkexp(-Ark)  = 0 (3.4) 

k=  l 

with  positive  real  part.  Using  this  fact,  one  can  show  that  for  n = 1 the 
condition  , |Bk|  < 1 is  necessary  to  have  continuous  dependence  of  the 
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solutions  of  the  NFDE  (D,/)  as  the  rk  -»0.  Analogous  results  for  the  matrix 
case  seem  to  be  difficult  to  obtain. 

Throughout  the  remainder  of  this  paper,  we  assume  the  solution  operator 
Td  f(t):  Q -»  Q is  defined  for  all  t > 0. 


4.  Stable  /^-Operators 

A few  years  ago  (see  [5]).  a class  of  neutral  functional  differential 
equations  was  defined  for  which  a rather  general  theory  has  evolved.  It  is  the 
purpose  of  this  section  to  define  and  give  some  properties  of  this  class. 

For  D e D atomic  at  0,  let  C D = (0  e C : £)0  = 0}.  On  Ce,  the  func- 
tional equation 


Dy,  = 0,  r>0,  yo  = 0eCD, 

generates  a strongly  continuous  semigroup  of  linear  transformations  TD(t): 
CD~*  CD,  t > 0.  defined  by  To(r)0  = y,{ if/).  Let  aD  be  the  order  of  the  semi- 
group TJt),  t > 0;  that  is, 

aD  = inf  {a  e R : there  is  a K = K(a ) such  that  TD(t)  \ < Keat,  t > 0}. 


Definition  [5].  D is  stable  if  aD  < 0. 

The  simplest  operator  D that  is  stable  is  D<p  = 0(0).  This  operator 
corresponds  to  retarded  functional  differential  equations. 


Theorem  4.1  [5].  The  operator  D is  stable  if  and  only  if  there  is  an 

a > 0,  b > 0,  such  that  for  any  n 6 R,  h e C([a , x ),  £"),  any  solution  y of 

Dy,  = h(t ),  f><7,  (4.1) 

must  satisfy 

|y,|  < be~ai,~a*\ya\  + b sup  |Jt(u)|-  (4.2) 

In  particular,  if  y is  a bounded  solution  of  (4.1)  on  ( — x,  0],  then 
|>’,|<b  sup  |h(u)|, 

- X <U<,t 


- x < I < 0. 


(4.3) 
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Theorem  4.2  [23].  Let  D e .'A,  D0  = J",  dn(0)<f>(0),  and  suppose  n has 
no  singular  part.  Then  D is  stable  if  and  only  if  there  is  a S > 0 such  that 
Re  / < — <5  for  all  /.  satisfying  the  characteristic  equation 

det  D(e'  I)  = 0.  (4.4) 

In  particular,  if  D0  = 0(0)  — £*=1  Bk4>(  — rk),  0 < rk  < r,  then  D is  stable  if 
and  only  if  there  is  a S > 0 such  that  Re  k < — 6 for  all  /.  satisfying 

det  (/  - £ Bke~'r^  = 0.  (4.5) 

If  the  rjrk  in  (4.5)  are  rational,  the  proof  of  Theorem  4.2  is  trivial.  The 
complications  arise  when  some  of  these  ratios  are  irrational  and  the  proof 
is  very  recent  [6,  23].  A less  complete  discussion  of  the  asymptotic  behavior 
of  solutions  has  been  given  by  many  authors.  In  fact,  in  [1]  it  was  proved 
that  individual  solutions  of  difference  equations  approach  zero  exponentially 
as  t -*  x under  the  above  hypotheses.  In  [21],  it  was  shown  that  the 
hypotheses  imply  there  are  a > 0,  K > 0,  such  that 

\TM\  <Ke~*'  |0|l5  t>  0, 

for  all  0 e C'([  — r,  0],  E")  n CD  and  |0|j  is  the  C1  norm  on  0. 

The  following  pages  will  be  concerned  with  NFDE  (D.J  ) with  D stable. 
A few  years  after  stable  D-operators  were  discussed  in  some  detail,  Melvin 
considered  the  following  problem:  Suppose  D is  the  difference  operator 
given  by  (3.3)  and  D is  stable.  Do  there  exist  intervals  (with  interior 
points)  Ik  containing  rk,  k = 1 N,  such  that 

Z50  = 0(0)  - £ Bk4>(  — rk)  (4.6) 

*=  i 

is  stable  for  each  rke  I kl  If  such  a situation  holds,  we  say  D is  stable 
locally  in  the  delays  (sld). 

To  be  sld  is  a necessary  requirement  in  a physical  problem  and  a com- 
putable characterization  of  this  property  is  very  important.  Some  results  have 
been  obtained.  For  w = 1,  it  was  shown  in  [32]  that  D is  sld  if  and  only  if 
Y!k=i  I®* I < 1-  This  result  is  also  given  implicitly  in  [23]. 

The  matrix  case  seems  to  be  extremely  difficult.  The  reason  for  this  is 
that  the  characteristic  equation  (4.5)  is  of  the  form 


(4.7) 
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where  all  the  njk  are  nonnegative  integers.  We  say  that  (4.7)  is  stable 
globally  in  the  delays  (sgd)  if  for  any  nonnegative  numbers  rk,k  = 1,  2, N, 
there  is  a S > 0 such  that  the  solutions  X of  (4.7)  satisfy  Re  / < -S.  It  is 
then  shown  in  [11]  that  sld  is  equivalent  to  sgd.  This  result  is  far  from  the 
characterization  that  one  would  like,  but  it  is  useful  to  obtain  the  regions 
of  stability  in  the  parameters  tij  in  specific  examples. 


5.  w-Limit  Sets 

In  this  section  we  summarize  some  of  the  implications  for  NFDE  (D,  f) 
of  the  hypothesis  that  D is  stable.  For  a solution  x(<p)  of  an  NFDE  (£),  /), 
let  y + (0)  = (J,20  *,(0)  be  the  positive  orbit  through  (p  and  let  to((p),  ot((p) 
be  the  to-  and  a-limit  sets,  respectively,  of  the  orbit  through  tp.  A set  U in  C 
is  invariant  for  the  NFDE  (D,  f)  if  for  any  (pel),  there  is  a function 
x:  ( — oo,  oo )-*  En  such  that  x0  = (p,  x,  e U,  t e ( — oo,  oo),  and  x satisfies  the 
equation  (£>,/)  for  all  t e ( — oo,  oo). 

Theorem  5.1  [5].  If  D is  stable,  then  y+ {(p)  is  relatively  compact  if  and 

only  if  y + ((p ) is  bounded.  If  y + (</>)  is  bounded,  then  to(tp)  is  a nonempty 
compact  connected  invariant  set. 

To  understand  the  manner  in  which  stable  D-operators  simplify  problems, 
we  give  a proof  of  the  first  part  of  this  theorem.  Obviously  y + (</>)  relatively 
compact  implies  y + (</>)  bounded.  Conversely,  suppose  y + ((p)  is  bounded. 
There  exists  a constant  M such  that  |/(y  + (</>))  | < M.  Since  for  any  x > 0, 
t >0,  x,  satisfies 

£(*r +<-*«)=(  f(xs)ds, 

* t 

it  follows  from  Theorem  4.1  that 

|xf+t  - x, | < b |xt  - (p | + bxM. 

Consequently,  x((p)(t ) is  bounded  and  uniformly  continuous  on  [-r,  oo). 
Thus,  y + (tp)  is  relatively  compact  and  the  result  is  proved. 

The  author  conjectured  some  time  ago  that  D stable  implies  the  backward 
extension  on  cu-limit  sets  of  bounded  orbits  are  unique.  For  analytic  systems 
this  conjecture  is  true.  In  fact,  the  following  result  is  known. 
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Theorem  5.2  [33].  Suppose  x is  a solution  on  ( — oo,  0]  of  the 
retarded  functional  differential  equation  (£>,/),  D<f>  = (f>(0),  f analytic.  Then 
x is  analytic  in  a region  containing  ( — oo,  0]. 

With  D any  stable  operator,  the  proof  in  [33]  can  be  modified  to  show 
that  the  same  result  is  true  for  any  NFDE  ( D,f ).  Consequently,  there  cannot 
be  two  backward  extensions  on  co-limit  sets  of  bounded  orbits. 

Mallet-Paret  has  communicated  an  example  to  the  author  that  shows  that 
the  above  conjecture  is  false  for  general  NFDE  (£>,/)  with  /in  C°°.  Con- 
sequently, it  is  desirable  to  prove  that  this  result  is  true  generically. 

For  a retarded  functional  differential  equation 

*(0  =/(*«)> 

any  bounded  solution  on  ( — oo,  oo)  obviously  must  be  C*+1  if  / is  C*. 
Surprisingly,  this  same  result  is  true  for  NFDE  (D,f)  with  D stable.  A less 
general  fact  for  k = 0 is  in  [12]  and  the  general  case  is  treated  in  the 
thesis  [27],  For  later  reference,  this  is  stated  as 


Theorem  5.3  [27].  For  an  NFDE  (£>,/  )eJx  &k,  D stable,  any  solu- 
tion bounded  on  ( — oo,  oo)  must  be  C*+1.  Even  more,  the  co-limit  set  of 
a bounded  orbit  consists  of  Ck+ 1 functions  such  that  for  each  j < k + 1,  the 
family  consisting  of  the  y'th  derivative  of  each  element  of  the  co-limit  set  is 
uniformly  bounded  and  equicontinuous. 

In  particular,  any  periodic  solution  of  an  NFDE  (D,f)e  M x &k  must 
be  in  C*+1.  Theorem  5.3  exhibits  a very  important  property  of  an 
NFDE  ( D,f ) with  a stable  D-operator.  Bounded  orbits  tend  to  “smooth” 
at  oo. 

The  basic  ideas  of  the  proof  of  Theorem  5.3  are  as  follows.  We  need 
the  following  lemma. 


Lemma  5.1.  Suppose  D is  stable,  h:  ( — oo,  a]  -*  £"  and  h are  uniformly 
continuous  and  bounded.  If  x(f)  is  a bounded  solution  of 

Dxt  = h(t) 

on  ( — oo,  a],  then  x(f)  exists  and  is  uniformly  continuous  and  bounded 
on  (-oo,  a]. 
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Proof.  Since  D(xf+r  - x,)  = h{t  + x)  - h(t).  Theorem  4.1  implies 
|xf+t-x,|<h  sup  \h(u  + t)  - h(u)\ 

u e ( — oo,  a] 

for  t e ( - oo,  a],  t + t e ( - oo,  a].  Thus  x is  uniformly  continuous  on  ( - oo,  a]. 
Also,  using  the  same  argument. 


*,+T  - *f 

_ - xt 

< b sup 

u e ( — oo,  a] 

h(u  + t)  — h(u)  h(u  + s)  — h(u) 

t 

s 

V3 

< b sup  I [/i(«  4-  in)  — h(u  + t>s)]  dv  . 

u 6 ( — oo,  a]  1^0 

Since  h is  uniformly  continuous,  [x(f  + r)  - x{t)]/x  approaches  a continuous 
limit  x(t)  as  t -+  0.  Obviously,  D{xt)  = h{t)  and,  as  before,  x{t)  is  bounded 
and  uniformly  continuous  on  (— oo,  a]. 

With  this  lemma,  we  prove  Theorem  5.3  for  k = 0.  If  h(t)  = Dxt,  then  h(t) 
is  bounded  and  continuous  on  (—  oo,  a].  Also,  dDxJdt  = h{t)  = f(x.)  is 
bounded  and  continuous  on  (—  oo,  a].  Thus,  h is  uniformly  continuous  and 
D stable  implies  x uniformly  continuous  on  ( - oo,  a],  and  so  {xf , t e ( - oo,  a]} 
is  precompact.  Thus,  h(t)  is  uniformly  continuous  on  ( — oo,  a].  Lemma  5.1 
implies  x is  C1  and  x uniformly  continuous  and  bounded  on  ( — oo,  a ].  The 
remainder  of  the  proof  of  the  theorem  follows  from  similar  reasoning. 


6.  Representation  of  Solution  Operator 


In  a retarded  functional  differential  equation,  x(t)  =f(x,),fe  ^ k , k > 0, 
the  solution  operator  T{t)  satisfies  the  following  property:  For  any  co  > r 
and  any  bounded  set  A c C for  which  {T(t)A,  0 < f < a>}  is  bounded,  the 
set  T(a>)A  is  precompact.  It  is  a fundamental  result  that  the  solution  operator 
of  an  NFDE  is  the  sum  of  a bounded  linear  operator  and  an  operator  of 
the  above  type. 

To  state  this  result  precisely,  some  notation  is  needed.  If  D is  atomic  at 
zero,  there  is  a matrix  O = (<f>1, . . . , <f>„)  with  each  <f>je  C such  that  D(<I>)  = I. 
Let  CD  = {0  e C:  D<f>  = 0},  'F  = / - <DD(  ),  and  let  TD(t)  be  the  semigroup 
on  C generated  by  the  equation  D(y,)  = Z>0,  y0  = 0-  Then  TD(t)\CD  is  also 
a semigroup  on  CD  and  4L  C -*CD. 
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Theorem  6.1  [13].  The  solution  operator  TD  f(t)  of  the  NFDE  (D,f) 

can  be  written  as 

TD'f(t)=TD(t)'V  + U(t),  f>0,  (6.1) 

where  U(t):  C -»  C satisfies  the  following  property:  For  any  « > 0 and  any 
bounded  set  A a C for  which  { T(t)A , 0 < t < w)  is  bounded,  the  set  V(a>)A 
is  precompact.  If,  in  addition,  D is  stable,  then  there  is  an  a > 0 such  that 
the  spectral  radius  p(TD(f)T)  < exp  (-at),  t > 0. 

We  indicate  the  proof  of  the  representation  formula  (6.1)  in  Theorem  6.1. 
There  exist  positive  constants  K > 0,  a > 0,  such  that  ||TD(f)4/||  < Ke  at , 
f > 0.  Define  U{t)  = TD  f(t)  - TD(t)'¥.  Then 

D(U(t)<f>)  = D{Td  f{t)<!)  — TD(tyV(f)) 

= D(TDtfm-D(TD(t)^) 

= D(f)  + [' f(TD  f(s)(f>)  ds  — D(T0) 

Jo 

= D<j>-h(f(TD'f(s)<t>)ds. 

Jo 


Therefore, 


t + r 


D(U(t  + t )<t>  - U(t)<t>)  = f /(TDt/(s)0)  ds. 

J t 


Theorem  4.1  implies 


\U(t  + t)0  - L/(r)</> | < 6|C/(t)0  - \ + b sup 

V 


fnjwds . 

J u 


Since  U{0)(f>  = Q>D<f>  lies  in  a finite-dimensional  space,  it  is  now  clear  that 
U satisfies  the  properties  stated  in  the  theorem.  This  proves  (6. 1 ). 

Suppose  Td  f(t)  has  the  following  property: 

(B)  For  each  f,  the  set  (Td  /(t),  0 < t < t)  is  bounded.  Then  Theorem 
6.1  asserts  that  TD  /(t)  is  the  sum  of  a bounded  linear  operator  T0(t)4' 
and  a completely  continuous  operator.  Furthermore,  if  D is  stable,  then  there 
is  an  a > 0 and  an  equivalent  norm  in  C so  that  |TD(f)4/|  < e~a\  t > 0. 

Consequently,  for  each  t > 0,  the  solution  operator  is  the  sum  of  a con- 
traction operator  and  a completely  continuous  operator.  This  is  stated  as 
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Corollary  6.1.  If  TD  f(t)  has  property  (B)  and  D is  stable,  then  U(t)  in 
(6.1)  is  completely  continuous  and  there  is  an  a > 0 and  an  equivalent  norm 
in  C such  that  | Ti,(f  J'F  | < exp  ( — at). 

For  D stable  and  TD  f(t)  satisfying  property  (B),  the  map  TD  f(t),  t > 0, 
is  an  a-contraction  in  the  sense  of  Kuratowski  [26],  with  a(TD j(t))  < 
exp  (-at).*  We  can  therefore  bring  to  bear  on  these  problems  all  of  the 
known  results  on  a-contractions  [36].  The  most  important  applications  so 
far  have  been  in  the  understanding  of  the  spectral  theory  for  linear  systems 
and  the  existence  of  periodic  solutions  of  periodic  NFDEs.  Some  aspects  of 
this  subject  will  be  mentioned  in  later  sections.  At  the  present  time,  we 
remark  only  that  the  functions  (D,  /)  can  depend  on  the  independent 
variable  t and  the  previous  theory  remains  valid. 


7.  Linear  Equations 

In  this  section  we  consider  linear  autonomous  NFDE  ( D , L)  e 06  x M 
and  perturbations  of  such  systems.  The  solution  operator  TD  L(t),  t > 0,  is  a 
strongly  continuous  semigroup  of  linear  operators  with  infinitesimal 
generator  A given  by  A<f>(0)  = <£(0),  -r  <0  < 0,  and  D(A)  = {<f>  e C : <^>  e C, 
D<j>  = L(J)}.  The  spectrum  <r(A)  of  A contains  only  the  point  spectrum  Pa(A) 
and  A e a(A)  if  and  only  if  A satisfies  the  characteristic  equation 

det  [D(exl)  — L(ex  I)]  = 0.  (7.1) 

For  a proof  of  these  facts  see  [21]. 

if  Q is  a bounded  linear  operator  on  C,  then  a complex  number  //  is 
called  a normal  eigenvalue  of  Q if  n e Pcr(Q)  and  there  is  an  integer  k such 
that  C = 91(Q  — nl)k  © yt(Q  — nlf  and  9l(Q  — /*/)*  has  finite  dimension.  A 
point  n is  a normal  point  of  Q if  it  is  either  a normal  eigenvalue  or  a 
point  in  the  resolvent  set  of  Q.  We  need  the  following  result  from  [9]: 

Lemma  7.1.  Suppose  P,  Q are  bounded  linear  operators  on  C with  Q 
compact.  If  U is  an  unbounded  connected  component  of  normal  points  of  P, 
then  U consists  only  of  normal  points  of  P + Q. 


* For  any  bounded  set  Bin  a Banach  space  X,<x(B)  = inf{d:  B has  a finite  cover  of  diam  <d}. 
If  T:  S a X -*  X.  then  a(T)  = inf  {It : tx(TB)  < k<x(B)  for  all  bounded  sets  B c Y}. 
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From  Lemma  7.1,  Theorem  6.1,  and  the  fact  that  P<r(TD  L(t))\{ 0}  = 
exp  <t (A)t,  we  have  the  following  important  result,  which  is  a special  case 
of  [23]. 

Theorem  7.1.  For  any  a > aD  and  any  k satisfying  (7.1)  with  Re  k > a, 
it  follows  that  //(f)  = eh  is  a normal  eigenvalue  of  TD  L(t). 

In  particular,  if  D is  stable,  then  aD  < 0,  the  set  A = {A  : Re  k > 0,  k e <r(A)} 
is  finite,  and  there  is  a natural  decomposition  of  C as  C = P © Q,  where 
P,  Q are  subspaces  invariant  under  TD  L(t)  with  the  spectrum  of  TD  L(t) 
restricted  to  P being  exp  At  and  P is  finite  dimensional. 

This  basic  result  allows  one  to  begin  the  discussion  of  the  behavior  of 
solutions  in  a neighborhood  of  an  equilibrium  point  for  nonlinear  equations; 
for  example,  the  saddle-point  property  [4a],  stability  in  critical  cases  [14,  22], 
bifurcation  [2,  15].  A recent  treatment  of  some  of  these  latter  questions  for 
retarded  equations  in  the  space  Wxl  rather  than  C has  been  given  in  [35]. 
With  this  latter  space,  one  can  discuss  bifurcation  caused  by  the  delays. 


8.  Linear  Periodic  Systems 

In  this  section  we  consider  periodic  linear  NFDE  (D,  L);  that  is,  D(t,  0), 
L(t,  0)  are  both  linear  in  0 and  co-periodic  in  t.  The  solution  operator 
Td  L(t,  r)  is  defined  for  all  t > x and  is  a bounded  linear  operator  on  C. 
The  following  definition  is  a modification  of  the  one  in  [19]. 

Definition  8.1.  A complex  number  p is  a characteristic  multiplier  of 
NFDE  (D,  L)  if  p is  a normal  point  of  TD  L(x  + co,  t). 

As  in  [16, 37],  one  shows  that  the  characteristic  multipliers  are  independent 
of  t.  There  is  a decomposition  of  C as  C = P © Q,  P = 91(T(t  + co,  t)  — p/)*, 
Q = t + to,  t)  — p/)*.  If  <I>  is  a basis  for  P with  P of  dimension  d,  then 
there  is  a d x d constant  matrix  B such  that  the  spectrum  of  eBo>  is  p and  an 
n x d co-periodic  matrix  C(r)  such  that  if  0 e P,  0 = <W>,  then 

T(t,  t)0  = C(t\emb. 

In  this  sense,  there  is  a Floquet  representation  on  the  generalized  eigen- 
space  P. 

From  Theorem  6.1,  if  expaDr  is  the  spectral  radius  of  TD(x  + to,  r),  it 
follows  that  any  point  p e <t(T(t  + co,  t))  with  |p|  > exp  aD x must  be  a 
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characteristic  multiplier.  In  particular,  if  D is  stable,  then  exp  aDx  < 1 and 
the  stability  properties  of  the  zero  solution  are  determined  by  the 
characteristic  multipliers. 

This  latter  remark  allows  one  to  prove  the  Fredholm  alternative  for 
nonhomogeneous  linear  systems  [17],  and  therefore  develop  a theory  of 
nonlinear  oscillations  for  weakly  nonlinear  equations.  The  Fredholm  alter- 
native is  also  a consequence  of  the  very  general  theory  of  two-point 
boundary-value  problems  in  [24]. 

Suppose  p is  a nonconstant  co-periodic  solution  of  an  autonomous 
NFDE  (D,f)e  :i6  x &k.  For  k > 1,  it  is  meaningful  to  consider  the  linear 
variational  equation  for  p, 

jt  Dy,  =fMt))yt  = >>,).  (8- 1 ) 

and  L(t,  (f>)  is  a>-periodic  in  t. 

From  Theorem  5.1,  p is  a C*+1  function.  Therefore,  D(p, ) = f(pt).  Differ- 
entiating this  expression  with  respect  to  t,  we  have  that  p is  a nontrivial 
periodic  solution  of  (8.1).  Consequently,  one  is  a characteristic  multiplier. 
One  should  now  be  able  to  extend  to  NFDEs  the  results  on  RFDEs 
concerning  orbital  stability  of  periodic  orbits  and  more  generally  the  behavior 
of  solutions  near  a periodic  orbit  [18,  38,  39].  Another  very  interesting 
result  that  should  be  extended  to  NFDEs  is  the  existence  of  “useful” 
coordinate  systems  near  a periodic  orbit  [40]. 

9.  Periodic  Solutions 

Consider  an  NFDE  (D,  f)  with  D(t,  </>), /(f,  <f>)  co-periodic  in  t,  co  > 0, 
and  D stable.  We  may  assume  from  Section  6 that  the  solution  operator 

TD,At)  = TDif(t,0) 

is  a weak  a-contraction  for  each  t > a.  Therefore,  an  immediate  consequence 
of  the  fixed-point  theorem  in  [25]  or  [7]  is  the  following 

Theorem  9.1.  If  there  exists  a closed  bounded  convex  set  T <=  C such 
that  {TD  /(r),  0 < f < cu}.  is  bounded  and  TD  /(co):  T -*  T,  then  there  is  an 
co-periodic  solution  of  the  NFDE  (D,f). 

Although  this  result  is  interesting,  it  is  not  very  useful  because  such  a 
convex  T almost  never  exists.  Usually,  one  is  only  able  to  prove  some  type 
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of  ultimate  boundedness  of  the  solution.  Fixed-point  theorems  sufficiently 
general  to  cover  these  situations  have  only  recently  been  proved  in  [20,  34]. 
To  describe  the  result,  we  say  that  the  NFDE  (D,f)  is  compact  dissipative 
if  there  is  a bounded  set  B c C such  that  for  any  compact  set  K c=  C there 
is  t = r (K)  such  that  TD  f(t)K  cz  B for  t > x. 

Theorem  9.2  [20].  If  the  NFDE  (D,f)  is  compact  dissipative,  then 
there  is  an  ^-periodic  solution. 

To  apply  Theorem  9.2,  one  must  have  a method  for  determining  when 
a system  is  compact  dissipative.  It  turns  out  that  the  use  of  Liapunov 
functionals  generally  puts  some  restrictions  on  the  delays  (see  [41]).  In  [28], 
sufficient  conditions  for  ultimate  boundedness  were  given  using  the 
Razumikhin-type  functions,  and  applications  were  made  to  transmission 
line  problems. 

The  development  of  fixed-point  theorems  in  [20]  was  motivated  directly 
from  the  application  to  NFDEs.  Other  fixed-point  theorems  are  in  this 
paper  and  an  application  to  the  existence  of  periodic  solutions  of  equations 
that  are  perturbations  of  one  with  an  asymptotically  stable  equilibrium 
point  is  given  in  [29].  The  proofs  in  [20]  also  were  the  motivation  for 
the  introduction  of  strongly  limit  compact  maps  in  [3], 
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Introduction 

The  purpose  of  this  chapter  is  to  present  recent  results  on  functional 
differential  equations  from  the  generic  point  of  view.  Although  most  results 
obtained  so  far  have  been  given  for  the  retarded  case,  it  seems  to  us  that 
neutral  equations  need  to  be  treated  soon.  Keeping  this  in  mind,  in 
Section  I a general  definition  is  given  for  autonomous  functional  differential 
equations  on  a manifold  M,  which  includes  retarded  and  neutral  cases. 
Examples  and  generic  results  extending  the  Kupka-Smale  theorem  for 
vector  fields  to  retarded  and  some  differential-delay  equations  are  pre- 
sented in  Section  II;  the  one-to-oneness  of  the  flow  for  retarded  equations, 
the  global  unstable  manifolds  of  hyperbolic  critical  points  and  periodic 
orbits,  and  the  invariance  of  nonwandering  sets  make  sense  generically 
and  are  also  considered  in  this  section.  Generic  linear  nonautonomous 
equations  have  one-to-oneness  in  the  solution  operator  and  appear  in 
Section  III. 


I.  Autonomous  Functional  Differential  Equations 

Let  M be  a C*  (or  analytic)  finite-dimensional  manifold,  / the  closed 
interval  [ — r,  0],  r > 0,  and  C°(L  M)  the  Banach  manifold  of  all  continuous 
maps  0 of  / into  M.  The  differentiable  structure  of  C°(/,  M),  the  smoothness 
of  the  evaluation  map  p : tf)  -*  0(0),  and  other  related  properties  can  be  found 
in  [12],  We  follow  closely  the  notation  used  in  [1]  for  calculus  on  manifolds. 

We  will  now  give  a geometric  definition  for  an  autonomous  functional 
differential  equation  in  a way  that  includes  some  important  and  well-known 
cases. 

* Research  supported  in  part  by  FAPESP,  Fundavao  de  Amparo  a Pesquisa  do  Estado 
de  Sao  Paulo.  Brazil.  Proc.  11-74/603. 
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An  autonomous  functional  differential  equation  (FDE)  on  M is  a pair  (D,  F) 
of  continuous  functions 

F:  C°(I,  M)  -* TM , D:  C°(l,  M)  - M, 

such  that  r M F = D.  Here  TM  denotes  the  tangent  bundle  of  M and  tm  is 
the  C00  (or  analytic)  canonical  projection  of  TM  onto  M.  Locally  one  can 
think  of  TM  as  a product;  then  for  each  (p  e C°(I,  M), 

F(4>)  = (D(ct>)J(<t>)). 

A solution  oJ  (D , F)  with  initial  condition  (p  at  t0  is  a continuous  function 
x(t)  defined  on  t0  — r < t < t0  + A,  0 < A < oo,  with  values  on  M,  such  that 
if  x,  e C°(I,  M)  is  defined  by  x,(0)  = x(t  + 0),  d e /,  t0  < t < t0  4-  A,  one  has 

(i)  xt0  = 0; 

(ii)  D(x()  is  a C1  function  for  re  [f0,  t0  + A)\ 

(iii)  d(D(x,))/dt  = F(x,)  for  all  t e[t0,  t0  + A),  where  J(D(x,))/dt  denotes 
the  tangent  vector  to  the  curve  £>(x()  at  the  point  t. 

Locally  one  can  write  on  TM: 

[D(x,),  d(D{x,))/dt]  = [D(x,),  f(x,)]  or  d(D(x,))/dt  =f(x,). 

The  solution  x(f)  is  sometimes  denoted  by  x(r;  r0,  (p ) and  x,  by  x,(f0,  (p) 
or  x,((p)  if  f0  = 0. 

A C‘  function  D:  C°(I,  M)  ->  M is  said  to  be  atomic  at  zero  if  for  any  (p 
in  C°(/,  M)  the  derivative  T^D  given  by 

T,D(^)=  f\(0,  fl)V(fl) 

* — r 

has  the  continuous  jump  A((p)  — [rj((p,  0)  — q((p,  0_)]  nonsingular,  say 
det  A(<p)  * 0 [6,  p.  279], 

An  autonomous  neutral  functional  differential  equation  (NFDE)  is  an 
FDE  (Z),  F)  with  D atomic  at  zero.  When  D is  the  evaluation  map  p,  the 
pair  (p,  F)  is  called  a retarded  functional  differential  equation  (RFDE)  (see 
[12,  Def.  1]).  Note  that  any  RFDE  is  an  NFDE.  In  fact,  we  only  need 
to  prove  that  p is  atomic  at  zero;  but 

T0p(T)  = T(O)=  |'\(0,  0)V(0), 

J -r 
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where  rj((f).  0)  = 0 for  0 e [ — r,  0)  and  >?(<£,  0)  = /.  This  shows  that  in  this  case 
A(4>)  = I 

If  (D.  F)  is  an  NFDE  with  F continuous  and  locall>  Lipschitzian.  it  can 
he  proved  that  one  obtains  existence,  uniqueness,  and  continuous  dependence 
of  solutions;  when  M = Rn  most  of  the  results  on  existence,  uniqueness, 
and  continuous  dependence  can  be  seen  in  [7],  For  the  case  of  an  RFDE 
on  a manifold  U some  of  these  questions  were  considered  in  [12]  and  the 
case  V/  - Rn  is  exhaustively  discussed  in  [4],  We  will  return  to  this  subject 
later  in  this  section. 

An  equilibrium  ( or  critical ) point  of  an  FDE  (D.  F)  is  a constant  function 
c e C°(/,  M)  such  that  F(c)  = 0.  A function  q(t ),  - x < t < + x,  is  said 
to  be  a solution  of  (D,  F)  on  ( — x,  + x)  if  for  every  a e ( — x,  + x),  the 
solution  x(t;  a,  qa)  of  ( D . F)  exists  and  satisfies  x,(g,  q„)  = gt,  t > g.  As  in 
the  case  of  ordinary  differential  equations,  the  constant  and  periodic  solutions 
q(t)  of  {D,  F)  on  (— x,  -l-x)  are  of  fundamental  importance  in  the 
qualitative  theory  of  functional  differential  equations. 

Many  questions  can  be  considered  when  we  try  to  understand  the 
qualitative  behavior  of  the  solutions  of  FDEs  with  respect  to  the  variation 
of  the  pairs  (D,  F)  in  some  topological  space.  The  closed  condition  r M F = D 
shows  that  the  smoothness  of  D is  at  least  equal  to  the  smoothness  of  F. 

The  generic  theory  for  RFDEs  on  compact  manifolds  has  been  developed 
considering  F in  the  Banach  space  M)  of  all  CK  maps  that  are 

bounded  with  bounded  derivatives  up  to  the  order  k > 1.  In  the  neutral 
case,  to  each  fixed  D corresponds  the  Banach  space  of  all  CK  maps 

F:  C°(T  M)  -*  TM 

such  that  r M ■ F = D and  that  are  bounded  with  bounded  derivatives  up  to 
the  order  k > 1. 

In  the  case  M = Rn  the  FDEs  can  be  written  as 

dD(x,)/dt  =/(xf). 

It  now  makes  sense  to  consider  special  NFDEs  as  pairs  (D,  f)e  'A  x FK, 
with  D atomic  at  zero,  being  the  Banach  space  of  the  bounded  linear 
operators  from  C = C°(I,  Rn)  into  Rn  with  the  usual  norm  and  FK  the  Banach 
space  of  all  CK  maps  from  H (the  closure  of  an  open  set  Q of  C)  into  Rn 
that  are  bounded  with  bounded  derivatives  up  to  the  order  k > I (see  [6, 
p.  279]). 
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II.  Autonomous  Retarded  Functional  Differential  Equations 

We  will  start  this  section  with  some  examples  of  RFDEs  on  a C' 
manifold  M. 

Example  1.  Any  CK-vector  field  on  M defines  a C"-RFDE  on  M.  k > 1. 
If  X : M -+  TM  is  a CK- vector  field  on  M , it  is  easy  to  see  that  F = X p 
is  a CK-RFDE  on  M.  All  solutions  of  X are  also  solutions  of  F.  The 
solutions  of  F on  (— x,  +x)  are  solutions  of  X\  in  particular,  constant 
and  periodic  solutions  are  the  same  for  X and  F. 

Example  2.  Any  CK-vector  field  on  C°(I,  M)  defines  a C"-RFDE  on  M, 
k > 1.  If  V:  C°(/,  M)~*  TC°(I,  M)  = C°(/,  TM)  is  a C'-vector  field  on 
C°(/,  M ),  then  F = Tp  • V is  a CK-RFDE  on  M,  since  tm  ■ Tp  = p ■ zcv. 
and  then  xM  • F = zM  • (Tp  ■ V)  = p. 

Example  3.  Fix  k > 1.  If  y:  C°(/,  M)-*  R is  a CK-real-valued  function 
and  X : M -»  TM  a CK-vector  field  on  M , then  F given  by 

F(0)  = y(0)  ■ *(0(0)) 

is  a C*-RFDE  on  M.  The  constant  solutions  (critical  points)  of  * are 
constant  solutions  of  F.  In  the  special  case  y(0)  = 1 +J°r02(0)d0,  the 
critical  points  of  F correspond  to  the  critical  points  of  *. 

Example  4.  Equation  of  first  variation.  Fix  k > 1 and  let  F : C°(/,  M)  -» 
TM  be  a CK  + '-RFDE  on  M.  The  double  tangent  space  of  M,  T2M , admits 
a canonical  involution  to:  T2M  -+  T2M,o2  = idT2M,  and  following  [1,  p.  18] 
o)  is  a C‘  -diffeomorphism  of  T2M  such  that  rTM  • w = Ttm  and  TxM  ■ oj  = 
rTM.  Using  the  above  relations  one  can  see  that  the  map 

oj  ■ TF:  C°(I,  TM)  = TC°(E  M)  -►  T2M 

is  a C*-RFDE  on  TM  called  the  first  variation  equation  associated  to  F. 
Locally,  if  (x(t),  y(t))e  TM  is  a solution  of  w • TF  and  F(0)  = (0(0), /(0)), 
one  can  write 

x(t)=f(x,),  y{t)  = Tf(x,)y,. 

Example  5.  Second-order  retarded  functional  differential  equations  on  M". 
The  double  tangent  space  T2M  of  M"  contains  a 3n-dimensional  sub- 
manifold J2  such  that  an  element  v of  T2M  belongs  to  J2  if  and  only  if 


FUNCTIONAL  DIFFERENTIAL  EQUATIONS— GENERIC  THEORY 


199 


oj(v)  = v,  oj  being  the  canonical  involution.  It  is  clear  that  zTS1  and  TxM 
coincide  on  J2.  A second-order  RFDE  on  M is  a Ck-RFDE  G:  C°(I , TM ) -» 
T2M  such  that  G has  values  on  J2.  Locally,  if  (x(f),  11(f))  e TM  is  a solution 
of  G then 

x = v,  v = g(x,,v,). 

In  such  a problem  we  look  for  basic  solutions  [curves  x(t)  on  M]  that 
obviously  satisfy  x = g(xt,  x,),  t > r. 

Example  6.  Consider  an  imbedding  of  a compact  manifold  M as  a closed 
submanifold  of  R \ for  a large  enough  N,  and  let  U be  a tubular  neighborhood 
of  M,  V an  open  set  of  RN,  and  a:  U -*  M the  C* -canonical  projection. 
The  maps 

q,  p:  C°(l,  M)  -»  TM 

defined  by 

P(<t>)  = T*"0t  [<M-  0]  and  q(<f>)  = 7a«0)ty(0)] 

are  C'-RFDEs.  Here  <p( 0)  and  4>(  — r)  are  considered  tangent  vectors  of  U 
at  the  point  0(0)  e M. 

Example  7.  Differential-delay  equations.  Consider  a CK-function  */: 
M x M ->TM  such  that  rMq  ~ M x M -*  M being  the  first  canonical 

projection.  The  map  g defines  a O-RFDE  F given  by  F(0)  = y(<p( 0),  0(  — r)). 
Locally  if  x(t)  is  a solution  of  this  equation  for  F,  one  obtains  x(t)  = 
g(x(t),x(t  - r)).  An  analogous  geometric  definition  can  be  given  for  equations 
of  the  type  x = 0(x(r),  x(f  - r,) x(f  - r)). 

Example  8.  An  RFDE  on  Rn.  Linear  RFDE.  Any  C'-function 
F:  C°(L  Rn)-+Rn  x Rn  of  the  form  F(0)  = (0(0), /(0))  is  a CK-RFDE  on 
Rn.  It  is  clear  that,  in  this  particular  case,  we  only  need  to  give  the 
CK-function  /:  C°(/,  /?")-»  Rn.  If  x(f)  is  a solution,  one  obtains  x =/(xf). 
In  this  case,  C°(/,  Rn)  is  a Banach  space;  then  it  makes  sense  to  consider 
linear  equations,  viz.,  choose  / as  a linear  continuous  map.  Examples  of 
RFDEs  on  R"  can  be  found  in  [4], 

Example  9.  Assume  that  for  a given  RFDE  on  Rn , x =/(x,),  there 
exists  a submanifold  S Rn  that  is  invariant  with  respect  to  the  equation, 
in  the  sense  that  for  any  0 € C°( / , Rn),  0(0)  e S,  0 e /,  the  solution  x(f ; 0;  0) 
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remains  on  S for  all  t > 0.  It  is  easy  to  see  that  the  restriction  of  / to 
C°(/,  S)  defines  the  following  RFDE  on  S: 

F:  C°(/,  S)  -*  TS 

by  F(0)  = (0(0), /(</>)).  The  next  two  examples  will  be  special  cases  of  this 
situation. 

Example  10.  An  RFDE  on  the  sphere  S2.  Let  M = S2  be  the  set  of 
all  (x,  y,  z)e  R 3 such  that  x2  + y2  + z2  = 1.  Consider  in  R 3 the  following 
differential-delay  system  of  equations: 

x = -x{t  - l)y(f)  - z(t ), 
y = x(t  - l)x(r)  - z(r), 
z = x(r)  + y(t). 

If  (x(r),  y(t),  z(r))  is  a solution  of  such  a system  it  is  easy  to  see  that 
xx  + yy  + zz  = 0 for  all  t > 0. 

Then  x2(f)  + y2(f ) + z2(f ) = a 2,  and  then,  if  an  initial  condition  0 is  such  that 
(f)(0)  e S 2 for  all  0 e [ - 1,  0],  one  concludes  that  the  solution  x(r;  0,  0)  has 
values  on  S2  for  all  t > 0.  The  above  considerations  show  that  F is  a well- 
defined  CX-RFDE  on  S2.  The  critical  points  of  F are 

» = «>/2-  l)1'2,  — L/2-  l),,J.v/2-  1) 

and 

v=(-(s/2-  I)1'2.  (^2  - I)1'2,  s/2  - 1). 

Example  11.  An  RFDE  on  the  circle  S'.  A similar  procedure  shows 
that  the  system  in  R2 

x = - x(r  - 1 )y(f ),  >’  = x(t  - 1 )x(r), 

induces  a C'-RFDE  on  the  set  S1  of  all  (x,  y)e  R2  such  that  x2  + y2  = 1. 
The  critical  points  in  this  case  are  o = (0,  1)  and  v = (0,  - 1). 

Example  12.  This  example  is  a combination  of  Examples  1 and  6. 
Take  in  S2  the  vector  field  Y given  by  an  infinitesimal  rotation  around 
the  vertical  z-axis.  Let  r = n/2  and  F be  given  by  F(0)  = 2V(0(O))  + p(0). 
Assume  that  if  a point  xeS2  belongs  to  the  equator,  |y(x)|=  1.  The 
critical  points  of  F are  ±v  = (0,  0,  1)  and  (cos  2 1,  sin  2 1,  0)  is  a periodic 
solution. 
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After  the  above  examples  we  will  try  to  describe  for  the  RFDE  on  a 
compact  manifold  M the  existence  ami  uniqueness  of  solutions  and  the 
behavior  of  them  with  respect  to  initial  data  and  to  the  equation. 

For  simplicity  of  notation  we  will  denote  C = C°(I , M)  and  .'/'K  = 
M)  for  a fixed  k > 1. 

For  F e ./'K  and  0 e C,  consider  the  following  problem  of  existence  and 
uniqueness  of  solution: 

x(t)  = F(x,),  t > t0,  x,o  = 0. 

The  answer  is  that  there  exists  a unique  solution  x(t)  with  such  an  initial 
condition  0 at  t = t0  ; the  function  x(t)  = x(f;  f0 , 0)  is  defined  on  t0  - r < 
t < + oc  and  has  values  on  M for  all  t > (t0  — r)  (see  [10]  and  [12]).  If  the 
continuous  function  F is  only  supposed  to  be  locally  Lipschitzian  in  0 6 C, 
the  same  result  is  true  [12]  (recall  that  M is  compact). 

Following  [10]  define  now  the  semiflow  map  of  F e .¥* 

<I>:  ,fK  x C x [0,  x ) -»  C, 

where  <I>(F,  0,  t)  — x,  = x,(0).  It  can  be  proved  that 

(a)  <t>  is  continuous  on  ,fK  xC  x [0,  oc), 

(b)  for  each  t > 0,  is  CK  on  x C x {r[, 

(c)  for  any  s,  0 < ,s  < k,  is  Cs  on  ./'*  x C x (,vr,  x). 

Now  fix  F e ./'K  and  define  for  each  r >0  the  CK-map  <I>(:  C -*  C by 
0,(0)  = x,(0).  Then 

<&/,  + r2  = <*>/,  • O/,  = ' <*>1,  and  4>0  = idC. 

The  solution  operator  O,  is  also  called  the  flow  of  F,Q>,  need  not  be  one-to-one, 
but  if  0 5*  T and  <t>(i(0)  = ^(T)  then 

<t>,l+M)  = <t,i2  + n('Y)  ^ all  <r>0. 

For  the  equations  given  by  Example  1 the  solution  operator  is  never  one- 
to-one  for  t = r. 

Example  13.  Consider  the  restriction  to  S1  of  the  following  system  of 
differential  delay  equations: 

x = -x(t  - l).v(r)(l  - x(t)),  y = x(t  - l)x(f)(l  - x(t)). 

The  critical  points  in  this  case  are  ±n  = (0,  1)  and  v = (1,  0).  The  initial 
conditions  /0  = 0 and  (f)(0)  = (cos  (n/2)0,  sin  (n/2)0),  0 e [-  1,  0]  define  a 
unique  solution  x(f;  0,  0)  that  for  / > 0 is  the  constant  map  (1,  0).  The 
solution  operator  O,  is  not  one-to-one  on  C°(/,  S1). 
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Example  14.  Take  M = R and  consider  the  scalar  equation  x = 
x(r  — 1).  It  is  easy  to  see  that  the  solution  operator  is  one-to-one  in  this  case. 

An  important  property  of  the  solution  operator  <J>,  is  that  for  t > r,  d>,  is 
a compact  map  [i.e.,  d>,  maps  bounded  subsets  of  C°(/,  M)  into  relatively 
compact  subsets  of  C°(/,  M)].  This  is  a consequence  of  the  boundedness  of 
F in  M)  and  the  classical  Arzela  theorem.  Then  even  if  0,  is  one- 

to-one,  it  can  never  be  a homeomorphism. 

The  behavior  at  infinity  of  the  solutions  is  another  very  important 
qualitative  concept.  An  invariant  set  of  a given  RFDE  F is  any  subset  S of 
C°(/,  M)  such  that  for  every  <p  e S,  there  is  a solution  x on  (-oo,  -i-oo) 
with  x0  = 0 and  x,eS  for  all  f e ( — oo,  +oo).  It  is  clear  that  the  set  A(F) 
of  all  x,  of  all  solutions  x(f)  on  ( — oo,  -I-  oo)  is  the  largest  invariant  set  of  F. 
When  A/  is  compact  A(F)  is  compact  in  C°(/,  M)  and 

m = n *„<  c°(/.  Mj). 

J = 1 

The  set  A(F)  is  called  the  attractor  of  F.  An  element  ¥ e C°(I,  M)  is  said 
to  be  in  the  co-limit  set  a>(0 ) of  an  orbit  y+(0)  = *,(0)  through  0 if 

there  is  a sequence  of  t„  -*  oo  as  n -*•  oo  such  that  x,r  -»  ¥ as  n -»  oo.  An 
element  ¥ e C°(/,  M)  is  said  to  be  in  the  a- limit  set  a (0)  of  an  orbit  through 
0 if  there  is  a solution  x(r)  on  (-oo,  +oo),  xo(0)  = 0,  and  a sequence 
tn  ->  — oo  as  n -*  oo  such  that  x,(0)  -*  ¥ as  n -*  oo.  The  following  result  is 
proved  in  [4]:  any  w-limit  set  w(0)  is  nonempty,  compact,  connected,  and 
invariant  [then  contained  in  A(F)\.  Again  we  need  the  boundedness  of 
F e .##■*(/,  M)  and  M.  Also  if  0 e A(F)  and  if  <D,  is  one-to-one  on  A(F) 
then  a(0)  is  nonempty,  compact,  connected,  and  invariant. 

A point  ¥ e A(F)  is  a nonwandering  point  of  F if  for  any  neighborhood  U 
of  ¥ in  A(F)  and  any  real  number  T > 0 there  exists  t = t(U,  T)>  T and 
'Pel/  such  that  0,(¥)  e U.  The  nonwandering  set  ft(F)  is  the  set  of  all  non- 
wandering points  of  F.  If  0 e A(F),  then  c o(<p)  £ fi (f)  and  a (0)  ^ Q(F).  The 
nonwandering  set  C1(F)  is  closed  and  if  O,  is  one-to-one  on  A(F)  then  fi (F) 
is  an  invariant  set.  As  in  the  ordinary  case,  points  in  fi(F)  can  exist  that 
do  not  belong  to  the  to - or  a-limit  sets  of  0 e C°(/,  M). 

Example  15.  Consider  in  a vertical  torus  T 2 the  Hamiltonian  vector 
field  X of  its  height  function  (see  [15]);  the  orbits  of  X belong  to  the 
intersections  of  T 2 with  horizontal  planes.  This  vector  field  has  four  critical 
points  (two  centers  and  two  saddles).  Use  Example  1 to  construct  the  RFDE 
X • p on  T2.  Any  nonconstant  solution  x(f)  of  X in  the  level  of  a saddle 
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is  a solution  of  X ■ p such  that  x,  is  a nonwandering  point  and  does  not 
belong  to  the  a>-  or  a-limit  set  of  elements  (p  e C°(/,  M). 

Given  a critical  point  cp  of  an  RFDE  F we  get  d>,(</>)  = (p  for  all  t > 0. 
The  derivative  7^  0>(  of  0>,  at  the  point  (p  (for  fixed  t > 0)  is  a linear  continuous 
map 

7;<D,:  7;  C°(/,  M)  -*  7^  C°(7,  M), 

T#  C°(/,  M)  being  the  tangent  space  of  the  manifold  C°(/,  M)  at  the  point  </>. 
This  derivative  7^,0),  is  given  by 

T0<Dr(4>)  = zt(n 

where  ¥ e 7^C°(/,  M)  and  zt(¥)  is  the  flow  of  cu  • TF , the  equation  of  first 
variation  of  E (see  Example  4).  The  critical  point  <p  is  said  to  be  non- 
degenerate or  transversal  (see  [10,  12])  if  1 does  not  belong  to  the  spectrum 
<7(7^0>,)  of  the  operator  0>,;  and  <p  is  said  to  be  elementary  or  hyperbolic 
if  *(  7*<r>,)  n S'  / 0>,  where  S1  is  the  unit  circle  in  the  complex  plane.  In  [4] 
some  properties  of  the  compact  operator  7^< I>r  are  described  (see  also  [9]  and 
[4,  Chapter  26]).  A hyperbolic  critical  point  has  the  saddle  point  property. 
The  solutions  defined  for  all  f < 0 and  remaining  near  <p  form  the  finite- 
dimensional local  unstable  manifold ; the  solutions  remaining  near  (p  for  all 
t > 0 form  the  finite  codimensional  local  stable  manifold.  The  critical  point  (p 
is  the  intersection  of  these  transversal  smooth  manifolds  and  their  tangent 
spaces  split  at  </>. 

Since  the  flow  <D,  is,  in  general,  not  one-to-one,  it  is  not  clear  if  we  can 
talk  about  a global  unstable  manifold  of  (p.  But  if  <J>,  is  one-to-one  on  the 
attractor  A(F)  and  7^<I>,  is  also  one-to-one  when  restricted  to  the  tangent 
vectors  of  the  local  unstable  manifold  of  the  point  (p , then  the  global  unstable 
manifold  of  (p  is  smooth. 

As  in  the  case  of  critical  points,  we  can  take  a nonconstant  T-periodic 
solution  x(f)  of  F (T  > 0 is  any  period)  and  if  <p  = x0  we  get  <t>T(<p ) = (p\ 
choose  T sufficiently  large  in  order  to  have  T > r,  and  again  the  map 

7; ay.  t+c°(i,  m)^t„c0(i,  m) 

is  a compact  operator  [4,  p.  196],  The  description  of  the  spectrum  <r(7^d>7) 
is  also  given  in  [4],  It  is  easy  to  see  that 

T+QAxo)  = *o  = [dxfdt]  (=0, 

so  1 belongs  to  <r(  T#  0>7  ).  The  period  T is  called  a transversal  period  or  the 
periodic  solution  is  called  nondegenerate  [10,  13]  if  1 has  multiplicity  one 
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(1  is  a simple  multiplier  [2,  3]).  If  x(r)  is  nondegenerate  and  er( 7^,  <I>r)  nS1  = 
{1},  then  x(t)  is  said  to  be  an  elementary  or  hyperbolic  periodic  solution. 
The  concepts  of  transversality  and  hyperbolicity  depend  only  on  the  orbit 
T = {.xf  |r  e R}  and  not  on  the  point  0 that  we  considered. 

Hale  [3]  showed  that  in  a neighborhood  of  the  orbit  of  a hyperbolic 
periodic  solution  there  are  local  stable  and  unstable  manifolds  as  a kind  of 
saddle  property. 

The  behavior  of  solutions  near  a hyperbolic  periodic  orbit  was  considered 
by  Stokes  [17]  in  the  case  where  x(f)  has  all  multipliers  different  from  1 with 
modulus  less  than  one.  In  some  sense  he  showed  the  existence  of  the  local 
stable  manifold  (in  this  case  the  orbit  is  the  unstable  manifold)  and  described 
the  asymptotic  behavior  of  the  solutions  near  T.  He  also  considered  the  stable 
manifold  for  a periodic  orbit  T in  the  case  where  there  exist  multipliers 
different  from  1 in  S'  but  with  (what  he  called)  simple  elementary  divisors. 
Later  Hale  [3]  showed  that  the  hyperbolicity  condition  implies  that  the  orbit 
T exhibits  a saddle  structure  in  the  sense  that  the  unstable  manifold  is  locally 
diffeomorphic  either  to  the  unstable  manifold  in  the  equation  of  first  varia- 
tion crossed  with  a circle  or  a generalized  Mobius  band,  and  the  stable 
manifold  is  a special  union  of  sets  each  of  which  are  locally  homeomorphic 
to  the  asymptotically  stable  manifold  in  the  equation  of  first  variation. 
Under  the  weaker  hypothesis  that  T is  nondegenerate,  Hale  also  showed  that 
there  is  a periodic  solution  of  a system  that  is  a perturbation  of  the  one 
given  provided  that  the  perturbation  is  small. 

It  is  quite  evident  that  we  need  to  extend  the  techniques  for  obtaining 
the  qualitative  behavior  of  solutions  near  other  invariant  sets.  A good 
definition  of  hyperbolicity  for  the  nonwandering  set  fi(F)  and  general 
theorems  analogous  to  Smale’s  spectral  decomposition  theorem  for  difTeo- 
morphisms  [16]  will  certainly  give  a good  description  of  the  behavior  of 
solutions  at  infinity. 

GENERIC  RESULTS  FOR  RFDE 

In  the  case  where  M is  a compact  manifold,  let  G,"  be  the  set  of  all 
elements  in  M)  such  that  all  critical  points  are  nondegenerate  (case 

/ = 0)  or  hyperbolic  (case  i = 1 ).  The  first  generic  result  says  that  G,K  is  open 
and  dense  in  ,^.fK(/,  M),  k > 1,  i = 0,  1 (see  [12]). 

Let  us  denote  by  GAK(a)  the  set  of  all  F e G,K  such  that  all  nonconstant 
periodic  solutions  of  F with  period  T e (0,  a]  have  T as  a transversal  period; 
and  let  G2*(a)  be  the  set  of  all  F e G,"  such  that  all  nonconstant  periodic 
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solutions  of  £ with  period  T e (0,  a]  are  hyperbolic.  In  [13]  it  is  proved 
that  GAK(u)  and  G2K(a)  are  open  sets  of  .#./'*(/,  M ),  k > 1.  The  proof  of 
the  openness  of  G2K(a)  was  given  by  the  author  together  with  Hale,  following 
the  same  lines  of  Peixoto’s  version  of  the  Kupka-Smale  theorem  [14]. 
Peixoto  uses  a classical  Hartman  theorem  for  ordinary  differential  equations, 
but  this  can  be  avoided  in  the  retarded  case.  The  proof  of  the  openness 
of  GAK(u)  was  based  on  a transversal  openness  theorem  [1,  p.  47]. 

For  the  case  M = Rn,  Mallet-Paret  [10]  considers  the  Banach  space 
of  C*-functions  /:  C = C°(/,  R")  ->  R"  such  that  all  derivatives  of  /of  order 
at  most  k are  bounded  on  C;  for  technical  reasons  related  to  the  utilization 
of  the  Sard-Smale  theorem,  he  required  k >2.  Calling  f&2  ^ VC  the  set  of 
all  / g 3C  such  that  all  critical  points  and  all  periodic  solutions  of  x = f(xt) 
are  hyperbolic,  Mallet-Paret  proved  that  fS2  is  a residual  subset  of  For 
doing  that  he  defines  four  classes  of  subsets  of  9C,  and  proves  that  they  are 
open;  for  E £ Rn  compact,  and  A e (0,  oo): 

'S0 (E)  {/ g SC\a\\  critical  points  a e £ are  transversal}; 

#i(E)  {/g  .^lall  critical  points  a g E are  hyperbolic}; 

#3/2(£,  A)  {/g  C^!(£)|all  nonconstant  periodic  solutions  of /lying  in  £ 
and  with  period  T e (0,  A]  have  T as  a transversal 
period}; 

^2(£,  A)  {/g  C#,(£)|  all  nonconstant  periodic  solutions  of/ lying  in  £ 
and  with  period  T e (0,  A]  are  hyperbolic}. 

With  a very  interesting  and  nontrivial  proof  he  showed  that  C^2(£,  A)  is  open 
and  dense  in  Then,  taking  sequences  Em  whose  union  is  Rn,  and  Am  -*  oo 
as  m -*  oo,  he  found  that  tf2  = Q*=1  ^2(£m,  Am ) is  residual  in  ?I.  Mallet- 
Paret  also  told  the  author  in  a private  communication  that  he  can  achieve 
the  same  result  by  staying  in  the  subset  (J  of  3f  of  all  differential-delay 
equations  of  the  type  x = ^(x(l),  x(f  — 1)).  In  this  case 

9 = {/e  X\ m = g(4>( 0),  <H~  I));  C}. 

An  open  question  is  to  prove  the  same  result  staying  on  the  set 

^ = i/^1/W  = #(-i));^c}. 

Call  the  set  of  all  / g fS2  such  that  the  stable  and  unstable  manifolds 
of  all  critical  points  and  periodic  solutions  are  transversal.  The  following 
conjecture  could  be  an  extension  of  the  Kupka-Smale  theorem : is  residual. 

It  seems  also  reasonable  that  following  [10]  one  can  get  G2K(a ) dense  in 
M),  for  compact  M,  k >2.  Is  the  Kupka-Smale  result  true  for  an 
RFDE  on  a compact  manifold? 
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Another  natural  question  is  to  introduce  a definition  of  local  equivalence 
between  two  RFDEs  in  the  neighborhood  of  a hyperbolic  critical  point  and 
try  to  prove  the  local  equivalence  between  a given  RFDE  and  its  linear  part. 
This  corresponds  to  a kind  of  Hartman  theorem.  This  and  many  other 
questions  can  be  considered  in  the  qualitative  and  generic  theory  of  RFDE 
with  regard  to  the  nice  corresponding  theories  for  ordinary  differential 
equations. 


BACKWARD  CONTINUATION  AND  ONE-TO-ONENESS 


In  [4,  Theorem  6.1]  it  is  proved  that  if  the  initial  condition  </>  e C°(/,  M) 
of  the  solution  x(f;  0,  (f>)  of  an  RFDE  F is  C1  in  a neighborhood  of  (0_), 
satisfies  <^>(0)  = F((f>),  and  F verifies  properties  (a)  and  (b)  below,  then  the 
solution  has  a small  unique  backward  continuation.  To  write  conditions 
(a)  and  (b)  we  need  to  describe  F locally,  say,  in  a neighborhood  U of 
</>  e C°(/,  M),  F is  given  by  F((f>)  = ((f)(0),  f (cp))  and 

T4f('V)=  f drj(cf,,  0)V(0). 

J -r 


(a)  F is  atomic  at  ( — r)  at  </>,  which  means 

det  .#{<f>)  = det  (ri(<f),  -r)  - rj((f),  -r+))  ^ 0. 

(b)  F has  smoothness  on  the  measure,  which  means  the  existence  of  a 
continuous  function  y((f>,  s),  <f>  e V,  s > 0,  y(<f),  0)  = 0,  such  that 


J -r 


< y((f>,  s)  sup  |4/(0)|, 


9 € [— r,  -r  + s]. 


It  can  be  proved  that  any  C'-RFDE  F has  smoothness  on  the  measure. 
The  only  sufficient  condition  to  be  verified  for  the  uniqueness  of  the  back- 
ward continuation  is  that  F must  be  atomic  at  — r at  the  given  (f>. 

If  F e .4i:f*(l,  M ) is  atomic  at  — r at  all  </>  in  some  compact  invariant 
subset  S of  C°(/,  M ),  the  flow  d>(,  t > 0,  restricted  to  S will  be  one-to-one 
and  then  a homeomorphism  of  S onto  S.  It  was  conjectured  in  [5]  that  O, 
is  always  one-to-one  on  co-limit  sets;  the  conjecture  is  true  for  F analytic, 
because  in  this  case  the  solutions  defined  on  ( — oo,  +00)  are  bounded 
(M  compact),  thereby  analytic  [11]. 
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Example  16.  Consider  the  scalar  equation 

*(')  =/(*»)  = M'  - Of- 

Since  x(t  — r)  = [x(t)]1/3,  the  flow  O,  is  one-to-one  on  C°(/,  R ),  t > 0,  but 
the  function  /is  not  atomic  at  — r at  </>  = 0;  in  fact, /(</>)  = [</>(  — r)]3  and 

7;/(T)  = 3[0(-r)]24>(-r). 

This  example  shows  that  to  be  atomic  at  — r at  any  4>  is  not  a necessary 
condition  for  one-to-oneness  of  the  flow. 

A very  important  question  is:  Under  what  conditions  is  the  attractor 
A(F ) a smooth  manifold?  When  M is  compact  and  F e A/),  k > 2, 

A(F ) as  a compact  (thereby  finite-dimensional)  manifold  implies  that  the 
solutions  x(t)  of  F on  ( — oo,  + oo)  will  be  given  by  a vector  field  on  /4(F); 
in  particular,  the  flow  of  this  vector  field  will  be  the  restriction  of  <D,  to 
/4(F),  and  thus  a difleomorphism. 

Note  that  if  the  compact  manifold  M is  analytic  and  F is  also  an  analytic 
RFDE  on  M,  all  solutions  on  A(F)  must  be  disjoint  and  the  solution 
operator  <D,  is  a homeomorphism  of  A(F)  onto  A(F).  The  nonwandering 
set  Q(F)  is  invariant  and  the  global  unstable  manifolds  of  hyperbolic  critical 
points  and  hyperbolic  nonconstant  periodic  orbits  are  analytic  finite- 
dimensional manifolds.  If  fl(F)  is  the  union  of  a finite  number  of  hyperbolic 
critical  points  with  a finite  number  of  hyperbolic  periodic  orbits,  the 
attractor  A(F)  is  the  union  of  all  (in  finite  number)  unstable  manifolds 
of  critical  points  and  periodic  orbits.  The  globally  defined  solutions  of  F 
are  then  solutions  of  a finite  number  of  vector  fields.  Is  /4(F)  a manifold 
in  this  case? 

The  procedure  of  approximating  a CK-RFDE  by  an  analytic  one  gives 
some  information  about  the  A(F),  since  in  the  analytic  case  the  flow  is 
one-to-one.  It  will  be  very  important  to  know  if,  at  least  generically,  A(F) 
is  a manifold.  Also  conditions  on  O(F)  need  to  be  considered  in  order  to 
get  a kind  of  continuity  for  the  maps  F -»  Q(F)  and  F -*  A(F). 

A Morse-Smale  RFDE  can  be  defined  as  retarded  equations  F for  which 
the  Q(F)  is  equal  to  the  union  of  a finite  number  of  critical  points  and 
periodic  orbits,  all  hyperbolic,  and  the  corresponding  stable  and  unstable 
manifolds  are  in  a transversal  position.  Is  the  set  of  all  Morse-Smale 
RFDEs  an  open  subset  of  A/)? 

The  equation  of  first  variation  relative  to  a solution  x(f)  of  a nonlinear 
equation  x = f(x,)  is  given  by  y = Tf(x,)y,  (see  Example  4).  The  one-to- 
oneness  of  its  flow  is  needed  to  define  globally  invariant  manifolds. 
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III.  Nonautonomous  Retarded  Functional  Differential  Equations 

All  the  definitions  we  need  to  consider  nonautonomous  retarded  t unctional 
differential  equations  of  the  type  x = f(t , xr)  can  be  found  in  [4],  It.  order 
to  point  out  some  recent  generic  results  of  Hale  and  the  author  [8]  we  will 
restrict  ourselves  to  the  linear  case.  Let  be  the  Banach  space  of  all  linear 
continuous  functions  A:  C°(I,  R")->Rn  with  the  usual  norm.  A linear 
retarded  functional  differential  equation  LRFDE  is  defined  by  any  function 
L:  R-*  '//  and  the  solutions  needed  to  satisfy 

x(i)  = L(t)xt . 

If  Cl(R , '//)  is  the  space  of  C'-LRFDEs  with  the  C‘-uniform  topology  on 
compact  sets  of  R,  it  is  proved  in  [8]  that  the  set  of  all  LRFDEs  such 
that  the  solution  operator  is  one-to-one  on  C°(l,  Rn ) is  a dense  subset  of 
C‘(R,  y/)  and  is  not  an  open  set.  For  a fixed-time  initial  condition,  they 
showed  that  the  set  of  all  LeCl(R , '//)  such  that  the  solution  operator 
is  one-to-one,  except  at  a countable  number  of  points  with  no  finite 
accumulation  point,  is  residual. 

Let  'y(R)  £ Cl(R,  y//)  the  set  of  all  L such  that  there  is  an  integer  N 
such  that  the  corresponding  measure  has  at  most  N discontinuities  in 
0 e [ — r,  0]  for  all  t e R and  is  a step  function  in  0.  For  K £ R compact, 
define  ry(K)  in  a similar  way.  Then  the  set  of  L in  £/(R)  for  which  the 
solution  operator  is  one-to-one  is  residual.  Also,  the  corresponding  set  is 
open  in  ry(K). 
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1.  Introduction 


I do  not  know  the  origin  of  the  terminology  “invariance  principle”  except 
that  I picked  it  up  from  others  around  1965.  The  principle  states  that,  if 
the  positive  limit  sets  of  a dynamical  system  have  an  invariance  property, 
then  Liapunov  functions  can  be  used  to  obtain  information  on  the  location 
of  positive  limits  sets.  The  application  of  this  principle  then  gives  a 
generalization  of  the  classical  Liapunov  theory  of  stability  and  instability. 
The  term  invariance  principle  has  also  been  used  at  times  to  refer  to  the 
results  obtained  by  the  application  of  the  principle.  (Propositions  1 and  2 
in  Section  2 are  general  invariance  principles.)  For  applications  it  has  the 
advantage  of  providing  more  general  results,  making  applications  easier, 
and  of  enlarging  the  class  of  useful  Liapunov  functions. 

This  exploitation  of  the  invariance  property  of  the  positive  limit  sets  of 
solutions  of  autonomous  ordinary  differential  equations  to  extend  and  unify 
Liapunov  stability  theory  can  be  dated  back  to  1960  [17,  18].  Barbashin 
and  Krassovski  in  1952  gave  a partial  result,  but  they  overlooked  the 
importance  of  invariance.  Primitive  forms  of  invariance  principles  can  be 
found  in  many  applications.  The  basic  ideas  are  simple  and  natural,  and 
perhaps  for  this  reason  invariance  principles  have  proved  to  be  useful  both 
for  theory  and  applications.  Applications  of  invariance  principles  have  now 
gone  beyond  autonomous  ordinary  differential  equations  and  are  known 
for  many  types  of  general  dynamical  systems.  It  would  be  rash  to  say  that 
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the  theory  is  complete.  In  spite  of  the  simplicity  of  the  basic  ideas,  dis- 
covering just  how.  to  associate  a dynamical  system  with  a particular  class 
of  functional  equations,  integral  equations,  partial  differential  equations,  or 
evolutionary  equations  can  be  difficult,  and  it  can  be  nontrivial  and  non- 
elementary  to  obtain  a theory  that  gives  direct  methods  for  studying 
stability  and  asymptotic  behavior. 

Among  the  people  who  have  contributed  to  these  developments  are 
LaSalle.  Hale.  Yoshizawa.  Miller.  Infante.  Sell.  Slemrod.  Dafermos.  Hurt. 
Cruz.  Chafee.  Wakeman.  and  Walker.  There  are  also  specific  applications 
too  numerous  to  mention.  (A  few  can  be  found  in  the  references). 

Originally.  I had  thought  I might  survey  some  of  these  more  recent 
developments  and  perhaps  discuss  some  specific  applications.  This  turns  out 
to  be  too  difficult  to  do  in  any  reasonable  fashion  within  space  limitations, 
and  there  are  many  papers  and  a number  of  surveys  easily  accessible. 

What  I have  decided  to  do  is  to  give  a simple  and  abstract  formulation 
that  shows  that  one  need  impose,  insofar  as  invariance  principles  and 
stability  theory  are  concerned,  very  little  topological  structure  on  the  state 
space.  This  brings  out  the  generality  of  the  invariance  principle  and 
illustrates  quite  well  the  simple  and  elementary  nature  of  the  basic  ideas. 
In  itself  the  abstraction  is  not  uninteresting,  and  stripping  the  theory  to 
its  bare  bones  makes  it  possible  to  see  clearly  the  role  of  certain  assumptions. 
At  the  same  time  the  structure  is  sufficient  for  discussing  the  difficulties 
involved  in  associating  dy  namical  systems  with  general  types  of  equations. 
The  difficulties  arise  in  obtaining  a theory  that  can  actually  be  applied  to 
the  solution  of  real  problems.  In  particular,  a general  formulation  is  given 
that  shows  the  relationship  between  the  work  of  Sell  ([38]  and  Miller  and 
Sell  [27.  29-32],  and  also  their  joint  paper  in  this  volume)  and  the  work 
of  Dafermos  [5-8]  for  associating  dynamical  systems  with  nonautonomous 
ordinary  differential  equations  and  integral  equations  and  processes  (non- 
autonomous flows)  in  general.  This  is  discussed  in  Section  3. 


2.  Abstract  Dynamical  Systems 

Here  the  state  space  will  be  a Frechet  space  (these  are  the  spaces  of 
class  (L)  studied  by  Frechet  in  his  dissertation  around  1906).  A set  P is  a 
Frechet  space  if  each  sequence  of  elements  pne  P either  has  a limit  pe  P 
associated  with  it  or  not.  If  it  has  a limit  p.  we  denote  this  by  pn  -*  p.  and 
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we  say  that  the  sequence  converges.  This  convergence  has  the  following 
properties: 

(1)  pn^p  and  pn^q  implies  p = q: 

(2)  pn  = p.  n = 1.2 implies  pn  -*  p:  and 

(3)  Pn  — * P implies  p,,  -»  p for  each  subsequence  pr:  of  pn . 

In  this  chapter.  I shall  always  mean  sequential  continuity,  sequential 
compactness,  etc.,  and  the  adjective  "sequential"  is  omitted  throughout 
Note  that  in  such  a space  P the  closure  of  a set  may  not  be  closed. 
However,  compact  sets  are  always  closed. 

Let  0:  [0.  cm)  -»  P (0  < cm  < x ).  Following  BirkhofT  [2],  q is  said  to  be  a 
positive  limit  point  of  <p  if  there  is  a sequence  tn  -*  cm  such  that  0(rn)  -*  q. 
Throughout  we  limit  ourselves  to  what  happens  as  t -*  cm  . and  from  this 
point  omit  everywhere  the  adjective  "positive."  (For  instance,  "invariance" 
will  always  mean  "positive  invariance.")  The  set  of  all  such  limit  points  will 
be  denoted  by  Q(0.  cm)  and  is  called  the  limit  set  of  0([O,  cm)).  We  say 
that  [0.  < m)  is  maximal  if  cm  = /.  or  Q(0,  cm)  is  empty  ([0.  cm)  is  the  maximal 
positive  domain  of  definition  of  0)  Throughout  P is  a Frechet  space. 

Definit  ion  1 [Dynamical  Systems  of  Type  (i)].  With  each  p e P there 
is  associated  an  interval  Up)  = [0.  cm(p)),  0 < oj(p)  < / . Let  / = '(/.  p):  pe  P. 
1 e Up) J.  A map  rt:  / -«  P is  called  a D, -system  on  P if 

A).  Relative  to  n(-.  p)  each  Up)  is  maximal. 

A 2 . 7t(0,  p)  = p. 

A3.  s,  t e R * = [0.  x)  and  s + tel(p)  implies  sel(n{t.  p))  and 

7t(.v  + 1.  p)  = 7t( s.  7t(t.  p)). 

A4 . n € C,  [some  continuity  conditions  on  Up)  and  n(t.  p)]. 

Systems  such  as  these  have  been  called  “local  semiflows"  or  "local 
dynamical  systems."  Usually  the  state  space  P is  given  more  topological 
structure  (see.  for  instance.  [29]  and  [ 1]). 

Immediate  consequences  of  A3  are 

P,  ,.  cm(p)  < t + cm( 7i( f . p))  for  all  t € l(p). 

Pi  2 f*  Up)  r l(q)  and  n(t.  p)  = nit.  q)  implies  n(t  -f  s.  p)  = n(t  + s,  q) 
for  all  s > 0 and  t +■  s e Up)  n Hq). 

Property  1.2  is  the  uniqueness  of  the  flow  in  the  positive  direction  of  t. 
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It  is  also  easy  to  see  that  A3  could  be  replaced  by 

A3*.  Given  p e P and  sel(p)  there  is  a q e P such  that  f>0  and 
s + te  Up)  implies  t e I(q ) and  n(t  + s,  p)  = n(t,  q). 

A3*  states  that  the  translate  of  a motion  is  a motion  [of  course,  q = n(s . p) 
for  a Dj-system]. 


Definition  2.  neCt  if  pn~>q  and  t e I(q)  imply  r e l(p„)  for  all  n 
sufficiently  large  and  n(t.  n„)  ->  n(t.  q). 

Thus  u 6 C,  means  lower  semicontinuity  of  l(p)  and  continuity  of  n(t.  p) 
with  respect  to  p for  fixed  t. 


Definition  3.  (a)  A set  M in  P is  said  to  be  weakly  invariant  (with 

respect  to  n)  if  n{I(p)-  p)  c M for  all  p e M. 

(b)  A set  M is  said  to  be  invariant  if  M is  weakly  invariant  and  l(p)  = R' 
for  each  p e M. 

(c)  We  say  that  7t(f,  p)  is  compact  if  n(Up).  p)  is  precompact  (contained 
in  a compact  set  of  P). 

For  Dr systems  it  is  easy  to  see  that  [Q(p)  is  the  limit  set  of  n(l(p)-  p)]'- 

P13.  Q(p)  is  weakly  invariant. 

P,  4.  If  n(t,  p)  is  compact,  then  Q{p)  is  nonempty,  precompact,  and 
invariant 

Even  though  n(t.  p)  is  compact,  it  may  be  that  Q(p)  is  not  closed,  and 
ii(p)  is  closed  if  and  only  if  it  is  compact.  Also,  it  is  not  difficult  to 
construct  a D, -system,  where  a j(p)  < t + a)  (n(t.  p))  for  some  t e Up )■ 

We  shall  now  introduce  a general  concept  of  a Liapunov  function  for  a 
D, -system  n and  show  for  D, -systems  that  Liapunov  functions  give  informa- 
tion about  the  location  of  limit  sets.  (See  Proposition  1.)  This  establishes 
the  invariance  principle  in  this  general  and  abstract  setting.  How  well  the 
Liapunov  function  locates  limit  sets  depends  on  the  Liapunov  function 
itself,  and  the  difficulty  in  applications  is  to  find  "good  " Liapunov  functions. 
For  instance,  a constant  function  is  a Liapunov  function  but  provides  no 
useful  information. 

Let  n be  a Dr system  on  P.  Throughout.  G denotes  an  arbitrary  subset 
of  P and  V is  a real-valued  function  on  P.  We  say  that  V is  a Liapunov 
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Junction  for  n on  G if  (i)  V is  continuous  on  (7.  and  (ii)  7r([0,  //].  p)  a G 
implies  l'(7r(r.  p))  is  nonincreasing  with  respect  to  t on  [0.  fi).  Let 

K = \P ■ v(p)  = c,  pe  (7;  = V~l(c)  n (7. 

and  define  .V/!  to  he  the  union  of  all  q e G with  the  property  that  (i)  I(q)  = R ' , 
and  (ii)  n(R  *,  q)  cr  Vc  for  some  c = c(q). 

Proposition  1.  Let  n be  a system  and  let  V be  a Liapunov  function 
for  n on  G If  n(t.  p)  is  compact  and  n(R\  p)  c G.  then  Q(p)  c M,  n Vc  for 
some  c = c{p). 


Proof  The  assumptions  on  n(t.  p)  and  V imply  that  V(n(t.  p))  is  non- 
increasing and  bounded  from  below  on  R ' . Hence,  V(n(t.  p))  -»  c as  l -»  x, 
and  Q(p)  c:  . It  then  follows  from  T,  4 (and  this  is  the  simple  observation 

behind  the  invariance  principle)  that  q € Ci(p)  implies  l(q)  = R*  and 
n(R  \ q)  c Lc.  Therefore.  ii(p)  c z M v This  completes  the  proof. 

It  is  usually  not  pointed  out  explicitly  that,  in  addition  to  Q(/>)  being  in 
,Vf|.  it  also  lies  on  a V = const  surface.  Although  an  obvious  fact,  it  is  worth 
noting.  For  example,  if  it  is  known  that  .V/,  n Vc  is  either  empty  or  a single 
point  for  each  c.  then  Ml  is  the  set  of  all  equilibrium  points  in  (7  and 
each  compact  solution  that  remains  in  G approaches  an  equilibrium  point 
as  t -*  x.  If  R r is  replaced  by  the  nonnegative  integers,  then  the  result  is 
an  invariance  principle  for  discrete  dynamical  systems  (difference  quations; 
see  [16]). 

We  now  consider  a slightly  stronger  continuity  property  for  n and  see 
what  changes  this  makes  in  the  above  results. 


Definition  4.  n e C'2  if  (i)  n e C{  and  (ii ) tn~*  t e Up)  implies  n(tn.  p)  -* 

n(t.  p). 


It  then  follows  easily  for  a D2-system  n that 
P2  o)(p)  = t + io(n(t.  p)).  t e l(p). 

P22.  tel(p)  n l(q)  and  n(t.  p)  = n(t.  q)  implies  I(p)  = l(q)  and 
n(t  -E  s,  p)  = n(t  + s.  q)  for  all  s > 0 and  r + s e l{p). 


For  a D2-system  n.  we  can  come  closer  to  the  direct  methods  of  Liapunov 
theory.  Let  V be  a real-valued  function  on  P and  define 


V(p)  = lim  inf 

1-0* 


y(Mt.  p))  - V{p) 

t 
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If  V(n(u  p))  is  nonincreasing  on  [0,  /I)  a l(p ),  then 

V(n(t,p))  = D+  V(n(t,  p))  < 0 

on  [0,  fi).  Conversely,  if  n is  a D2-system  and  V is  lower  semicontinuous 
on  P,  then  V(n(t,  p))  < 0 on  [0,  /f)  implies  V(n(t,  p))  is  nonincreasing  on 
l(p).  We  have  therefore. 

Lemma  1.  Let  n be  a D2-system  on  P.  Then  V is  a Liapunov  function 
of  7t  on  G if  and  only  if  V is  continuous  on  (7  and  V(p)<()  for 
all  p 6 G.  If  V is  a Liapunov  function  of  n on  G,  then 

Af,  c M, 

where  M is  the  largest  invariant  set  in  E = \q\  V(q)  = 0,  q e (7)  [that  is,  M 
is  the  union  of  all  n(I(p),  p)  for  which  I(p)  = R'  and  n(R  ' , p)  c £]. 

As  a consequence  of  this  lemma  and  Proposition  1.  we  have 

Proposition  2.  Let  n be  a D2-system  and  let  V be  a Liapunov  function 
of  n on  G.  If  n(t , p)  is  compact  and  remains  in  G for  all  t > 0,  then 

ii(p)  c M n Vc  for  some  c = c(p). 

If  V(n(t,  q))  is  absolutely  continuous  with  respect  to  t for  each  q e £, 
then,  of  course,  M = M v In  any  case  in  the  application  of  this  type  of 
result,  it  is  the  set  M that  can  be  located  by  direct  methods,  and  we  see 
in  this  abstract  setting  the  role  of  continuity  properties  of  n(t,  p)  with  respect 
to  /.  In  applications,  the  space  P will  be  a metric  space  and  the  motions 
n(t , p)  correspond  to  solutions  of  some  sort  of  evolutionary  equation.  For 
ordinary  differential  equations,  the  space  P is  /i-dimensional  Euclidean 
space,  but  more  generally  P is  a Banach  space  and  is  not  locally  compact. 
This  raises  a practical  difficulty.  To  use  Proposition  2 requires  being  able 
to  verify  the  compactness  of  solutions,  and  this  may  not  be  possible  or 
obvious  by  direct  methods.  Getting  around  this  difficulty  is  the  major 
problem  to  be  overcome  in  developing  a practical  stability  theory.  This  is 
well  illustrated  by  the  examples  discussed  in  [14], 

Let  us  note  one  more  difference  between  D,-  and  D2- systems.  Another 
definition  used  tor  the  positive  limit  set  is 

r< p)  = n *<[/».  <>>■  !>)■ 

II  >0 
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It  is  dear  that  BirkhofT’s  Q(p)  is  always  contained  in  T(p),  and  for  Drsystems 
Q(p ) can  be  smaller  than  T (p).  However,  for  D2-systems  we  have 

P2  3.  If  7t  is  a D2-system,  then  Q(p)  = T(p). 


3.  Associating  Dynamical  Systems  with  Nonautonomous 
Flows  (Processes) 

Following  a technique  of  Miller  [27],  Sell  [38]  showed  how  to  associate 
in  a significant  manner  a dynamical  system  with  a nonautonomous  system 
of  ordinary  differential  equations.  (Today  this  associated  dynamical  system 
is  being  called  a “skew-product”  dynamical  system.)  Then  from  the 
invariance  property  of  the  positive  limit  sets  of  the  dynamical  system  (see 
P 1 3 and  P,  4),  Sell  obtained  all  the  invariance  properties  known  at  that 
time  for  the  limit  sets  of  the  solutions  of  nonautonomous  systems  (see,  for 
example,  [19,  24,  25.  27,  33,  49]).  This  is  a very  fruitful  idea.  Dafermos 
obtained  similar  invariance  results  in  his  study  [5]  of  compact  processes. 
In  [8]  Dafermos  relates  his  ideas  to  those  of  Miller  and  Sell.  Here  I shall 
place  these  ideas  of  Miller,  Sell,  and  Dafermos  in  an  abstract  setting,  also 
borrowing  the  terminology  they  have  introduced. 

Definitions.  Let  X be  a Frechet  space.  With  each  (r0,  x)eR  x X 
there  is  associated  an  interval  l{t0,  x)  = [0,  o )(t0,  x)),  0 < <u(t0,  x)  < ac. 

Let 

/ = {(r,  f0,  *);  {t0,  x)  e R x X,  t e I(t0 , x)}. 

A map  u:  / -*  X is  called  a process  if 

(1)  relative  to  u each  l(t0,  x)  is  maximal, 

(2)  w(0,  t0,  x)  = x, 

(3)  s,  t e R*  and  s + t e l(t0 , x)  implies 

.s  e l(t0  + /,  u(t,  t0,  x))  and  u(.s  + t,  t0,  x)  = u(s,  t0  + f,  u(t,  t0,  x)). 

(4)  [some  continuity  conditions  on  l(t0,  x)  and  u]. 

For  a nonautonomous  differential  equation,  x =/(f,  x)  with  </>(f,  t0,  x°) 
the  solution  (for  t > t0 ) satisfying  (p(t0 , r0 , x°)  = x°.  Note  that  u corresponds 
to 


M (f,  t0,  X°)  = (f)(t  + t0,  1 0,  X°). 
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Although  it  is  important  to  consider  specific  continuity  conditions  (4),  I 
wish  here  to  discuss  only  general  ideas  behind  associating  a dynamical 
system  with  a process  and  the  difficulties  behind  doing  this  insofar  as  stability 
theory  is  concerned.  Also,  for  simplicity,  I will  assume  that  I(tQ,  x)  = R" 
for  all  (f0,  x)  e R x X (y  = R * x R x X).  The  changes  necessary  for  a local 
theory  are  not  difficult  to  carry  out. 

Let  W denote  the  set  of  all  mappings  of  R+  x R x X -*  X and  let  U 
be  the  subset  of  all  processes  in  W.  The  translate  u,  of  a process  u is  defined 
by  ur(f,  t0 , x)  = u{t , t0  + r,  .xj. 

Define  or.  R*  x X x V -*  X by 

a(f,  a,  u)  = u(t,  0,  a). 

It  then  follows  from  (3)  that 

a(.s  -f  t,  a,  u)  = »(/,  a(.v,  a,  u),  us)  for  all  s,  t > 0. 

Then  with  p = (a,  u)  e X x U = P define 

7r(f,  a,  u)  = ( ot(t , a,  u ),  u,). 

Clearly  n satisfies  Aj-A3  of  Definition  1 for  a dynamical  system.  The 
remaining  and  difficult  question  (if  n is  to  be  at  least  a Dj-system)  is  the 
continuity  property  A4 . It  is  too  much  to  expect  that  n will  be  a dynamical 
system  on  P = X x U but  we  can  hope  that  n will  be  a dynamical  system 
on  P*  = X x U*,  where  U*  is  a subset  of  U.  Suppose  that  W is  a Frechet 
space  and  that  convergence  in  X and  the  continuity  property  (4)  for 
processes  have  been  fixed.  We  then  want  to  have  the  invariance  property 
of  compact  motions  n(t,  p)  induce  an  invariance  property  for  compact 
motions  a(t,  a,  u)  = u(t,  0,  a). 

This  means  that  U*  must  be  translation  invariant  (u  e U*  implies  u,  e U*) 
and,  in  fact,  that  u e U*  implies  H(u)  = u ^ cr  U * [H(u)  is  called  the  hull 
of  u].  These  are  Sell’s  regular  processes  [38],  Moreover,  for  induced  invariance 
the  motion  u , in  W must  be  compact  in  order  that  compactness  of 
u(t,  0,  a)  in  X correspond  to  compactness  of  n(t.  p)  in  P.  These  are  Dafermos's 
compact  processes  [5,  8]. 

Let  T(a,  u)  = r{p)  c X be  the  limit  set  of  u(t,  0,  a),  and  let  Q(p)  c P be 
the  limit  set  of  7r(f,  p).  Q(p)  = T(/>)  x H^(u),  where  Hx(u)  is  the  limit  set 
of  u,  in  W [Hx(u)  is  called  the  asymptotic  hull  of  u].  We  then  have 

Proposition  3 (Induced  Invariance).  Ifu  is  a compact  regular  process 
and  u(t,  0,  a)  is  compact  in  X , then  T(.v,  u)  is  invariant  in  the  sense  that, 
if  ye  T(a,  u),  there  is  a re  //x(u)  such  that  v(R  + , 0,  y)  c:  T(a,  u). 
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In  the  processes  considered  by  Dafermos,  his  convergence  is  such  that 
every  process  is  regular  but  this  will  not,  in  general,  be  true.  Proposition  3 
is  simply  a statement  of  a general  method  for  obtaining  invariance  properties 
of  nonautonomous  flows  defined  by  processes.  The  processes  themselves 
may  be  defined  by  the  solutions  of  nonautonomous  evolutionary  equations. 
The  difficulty  is  to  find  a “good”  convergence  for  processes.  A “best” 
convergence,  if  one  exists,  maximizes  the  class  of  regular  compact  processes. 
Weakening  the  convergence  may  make  the  class  of  regular  processes  larger 
but  may  decrease  the  class  of  compact  processes. 

In  the  case  of  ordinary  differential  equations  x = J'(t , .x).  Sell  [38]  took  the 
compact  open  topology  for  the  functions  f He  then  showed  that  the  processes 
defined  by  / are  regular  if/satisfies  a local  Lipschitz  condition  with  Lipschitz 
constant  independent  of  t.  However,  then  / is  compact  if  and  only  if  / is 
bounded  and  uniformly  continuous  on  each  R + x K,  K a compact  set  of 
R".  This  covered  all  the  invariance  principles  for  nonautonomous  ordinary 
differential  equations  known  at  that  time.  In  1972,  Peng  [36],  motivated  by 
the  practical  problem  of  controlling  a system  with  unknown  but  bounded 
parameters,  obtained  directly  an  invariance  property  for  his  systems,  which 
is  not  included  in  the  above-mentioned  result  of  Sell.  In  1974,  Wakeman  [44] 
used  a weaker  topology  for  the  functions  / [the  compact  open  topology 
for  F(x,  t)  = Jo  f(s,  x)  J.v]  and  obtained  an  invariance  property  for  the  limit 
sets  of  bounded  solutions  for  a much  wider  class  of  / His  class  of  compact 
regular  / is 

(1)  /satisfies  a Caratheodory  condition,  and 

(2)  for  each  compact  K in  Rn  there  is  an  / such  that 

(a)  |/M)|  < /, 

(b)  |/(f,x)-/(t,  y)\  <l(x-y), 
for  all  (f,  x)  in  R + x K. 

By  the  invariance  principle  this  gives  a stability  theory  analogous  to  that 
for  autonomous  ordinary  differential  equations,  and  this  should  be  a much 
more  powerful  tool  for  the  stability  analysis  of  nonautonomous  systems 
than  the  corresponding  classical  Liapunov  theory,  which  rules  out  many 
natural  Liapunov  functions.  A possible  type  of  application  can  be  found 
in  [11],  where  the  problem  is  to  control  a space  station  in  an  unstable 
periodic  orbit.  One  way  of  doing  this  is  to  determine  feedback  control  to 
stabilize  the  motion  (make  the  orbit  asymptotically  stable).  Fearnsides  and 
Levine  [11],  using  the  same  notation  as  in  Section  2,  selected  instead 
a sufficiently  large  G of  initial  conditions  and  a set  E (a  sufficiently  small 
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set  about  the  periodic  orbit).  Then  using  a Liapunov  function  V they 
determined  controls  so  that  G is  positively  invariant  and  E corresponds 
to  V = 0 in  <j.  This  then  ensures  that  motions  starting  in  G approach  E 
(in  fact,  by  the  invariance  property  for  nonautonomous  systems  and 
Proposition  2 they  could  have  concluded  that  the  solutions  approach  the 
smaller  set  M).  Digital  computer  simulation  showed  that  the  control 
(thrusters)  kept  the  actual  motion  close  to  the  desired  motion  with  a 50% 
reduction  in  fuel  and  the  number  of  thruster  firings  compared  with  making 
the  orbit  itself  asymptotically  stable. 

One  difficulty  in  connection  with  Wakeman’s  results  is  that  he  obtains 
sufficient  conditions  for  asymptotic  stability  (with  the  stability  being  uniform) 
but  not  for  uniform  asymptotic  stability.  In  applications  one  wants  to 
assure  stability  under  perturbations,  and  this  is  the  reason  for  wanting 
uniform  asymptotic  stability.  Of  course,  another  question  here  is,  can 
Wakeman’s  results  be  improved  and  is  there  in  any  sense  a “best”  topology 
(or  convergence)  for  the  functions  /? 


REFERENCES 

[Ij  Bhatia,  N.  P.,  and  C).  Hajek,  "Local  Semi-Dynamical  Systems."  Springer-Verlag.  Berlin. 
1969. 

[2]  BirkhofT,  (i.  D„  “Dynamical  Systems,"  Amer.  Math.  Soc.  Colloq.  Publ.  Vol.  9.  Amer. 
Math.  Soc..  Providence,  Rhode  Island,  1927. 

[3]  Chaffee,  N„  and  Infante,  E.  F„  A bifurcation  problem  for  a nonlinear  partial  differential 
equation  of  parabolic  type.  CDS  Tech  Rep.  74-5,  Brown  Univ.  (March  1974). 

[4]  Cruz,  M.  A.,  and  Hale,  J.  K..  Stability  of  functional  differential  equations  of  neutral 
type,  J.  Differential  Equations  7 (1970),  334  355. 

[5]  Dafermos.  C.  M . An  invariance  principle  for  compact  processes,  J Differential  Equations 
9 (1971),  239  252. 

[6]  Dafermos,  C.  M..  Applications  of  the  invariance  principle  for  compact  processes. 
I.  Asymptotically  dynamical  systems,  J.  Differential  Equations  9 (1971),  291  299; 
II  Asymptotic  behavior  of  solutions  of  a hyperbolic  conservation  law,  ibid.  11  (1972), 
416  424. 

[7]  Dafermos,  C.  M„  Uniform  processes  and  semi-continuous  Liapunov  functionals.  J. 
Differential  Equations  II  (1972),  401  405. 

[X]  Dafermos,  C.  M„  Semitlows  generated  by  compact  and  uniform  processes.  Math.  Systems 
Theory  8 (1975),  142  149. 

[9]  Dafermos,  C.  M.,  and  Slemrod.  M . Asymptotic  behavior  of  nonlinear  contraction 
semigroups,  J.  Functional  Analysis  13  (1973),  97  106. 

[10]  DiPasquanionio,  F„  Stability  in  the  first  approximation  and  a critical  case  relating  to 
nuclear  reactor  kinetics  equations,  Nukleonik  II  (196X).  276  2X2. 

{ II]  Fearnsides.  J.  J.,  and  Levine,  W.  S..  On  the  determination  of  the  asymptotic  behavior 
of  an  inertially  oriented  space  station,  IEEE  Trans.  Automatic  Control  AC-19  (1974), 
1X6  191 


STABILITY  THEORY  AND  INVARIANCE  PRINCIPLES 


221 


[12]  Hale,  J k.  A stability  theorem  for  functional  differential  equations.  Pruc.  Nat.  Acad. 
See  USA  50  (1963).  942  946 

[13]  Hale.  J k.,  Sufficient  conditions  for  stability  and  instability  of  autonomous  functional 
differential  equations,  J Differential  Equations  I (1965),  452  482. 

[14]  Hale.  J k..  Dynamical  systems  and  stability.  J Math.  Anal.  Appl.  26  (1969),  39  59. 

[15]  Hale,  J k.  and  Infante,  E.  F.,  Extended  dynamical  systems  and  stability  theory,  Proc. 
Nat.  Acad  See  U S A 58  (1967).  405  409. 

[ 16]  Hurt.  J..  Some  stability  theorems  for  ordinary  difference  equations.  SI  AM  J Numer.  Anal. 
4 (1967).  582  596. 

[17]  LaSalle.  J P.  The  extent  of  asymptotic  stability.  Proc.  Nat  Acad  Sci.  U S A.  46  (I960). 
363  365 

[ 18]  LaSalle.  J P . Some  extensions  of  Liapunov’s  second  method.  IRE  Trans.  Circuit  Theory 
CT-7  (I960).  520  527. 

[ 1 9 J LaSalle.  J P.  Asymptotic  stability  criteria,  in  Proc.  Symp.  Appl.  Math.,  Hydrodynamic 
Instahilit i 13.  299  307  Amer  Math  Soc.  Providence,  Rhode  Island  (1962). 

[20]  LaSalle.  J P.  Liapunov's  second  method,  stability  problems  of  solutions  of  differential 
equations.  Proc  NATO  Adian  Study  Inst..  Padua.  Italy  pp.  95-106.  Edi/iom 
"Oderisi."  (iuhhio  (1966) 

[21]  LaSalle.  J P.  An  invariance  principle  in  the  theory  of  stability,  in  Differential  Equations 
and  Dynamical  Systems.  Proc.  Int  Symp..  Puerto  Rico  pp.  277  286.  Academic  Press,  New 
York  (1967) 

[22]  LaSalle.  J P Stability  theory  for  ordinary  differential  equations,  J Differential  Equations 

4 (1968).  57  65 

[ 23 ] LaSalle.  J . and  Lelschet/.  S . "Stability  by  Liapunov’s  Direct  Method  with  Applications.” 
Academic  Press,  New  York.  1961 

[24]  Markus.  L . Asymptotically  autonomous  differential  systems,  in  "Contributions  to  the 
Theory  of  Nonlinear  Oscillations.”  Vol  3.  pp  17  29  Princeton  Univ.  Press,  Princeton. 
New  Jersey.  1956 

[25]  Miller.  R k . On  almost  periodic  differential  equations,  Hull.  Amer.  Math.  Soc.  70  (1964), 
792  795 

[26]  Miller,  R k , Asymptotic  behavior  of  nonlinear  delay  differential  equations,  J.  Differential 
Equations  I (1965).  293  305. 

[27]  Miller,  R k . Almost  periodic  differential  equations  as  dynamical  systems  with  applications 
to  the  existence  of  a.  p.  solutions,  J Differential  Equations  1 (1965),  337  345. 

[ 28 J Miller.  R k . Asymptotic  behavior  of  solutions  of  nonlinear  differential  equations,  Trans. 
Amer  Math  Soc  115(1965).  400  416. 

[29]  Miller,  R k . The  topological  dynamics  of  Volterra  integral  equations.  Stud.  Appl.  Math. 

5 ( 1969).  82  87. 

[30]  Miller,  R k . and  Sell.  (J  R.,  A note  on  Volterra  integral  equations  and  topological 
dynamics.  Bull.  Amer.  Math.  Soc  74  (1968).  904  908. 

[31]  Miller,  R k..  and  Sell.  Ci.  R..  Existence,  uniqueness  and  continuity  of  solutions  of 
integral  equations.  Ann.  Mat  80(1968).  135  152. 

[32]  Miller.  R k.,  and  Sell.  (J.  R . Volterra  integral  equations  and  topological  dynamics, 
Mem  Amer.  Math.  Soc.  102  (1970). 

[33]  Opial.  Z.,  Sur  la  dependance  des  solutions  d’un  systeme  d'equations  differentielles  de 
leurs  seconds  membres.  Application  aux  systemes  presque  autonomes,  Ann.  Polon.  Math. 
8 (I960).  75  89. 

[34]  Parks,  P.  C.,  A stability  criterion  for  a panel  Ifuttcr  problem  via  the  second  method  of 
Liapunov,  in  Differential  Equations  and  Dynamical  Systems.  Proc.  Int.  Symp..  Puerto  Rico 
pp.  287  298.  Academic  Press.  New  York.  1967. 


222 


JOSEPH  P.  LASALLE 


[35]  Pazy,  A.,  On  the  applicability  of  Liapunov’s  theorem  in  Hilbert  space.  SIAM  J.  Math. 
Anal.  3 (1972),  291  294. 

[36]  Peng,  T.  K.  L.,  Invariance  and  stability  for  bounded  uncertain  systems,  SIAM  J.  Control 
10  (1972),  679  690. 

[37]  Plaut,  R.  H.,  Asymptotic  stability  and  instability  criteria  for  some  elastic  systems  by 
Liapunov’s  direct  method.  Quart.  Appl.  Math.  (1972),  535  540. 

[38]  Sell,  (i.  R.,  Nonautonomous  differential  equations  and  topological  dynamics,  Trans.  Amer. 
Math.  Soc.  127  (1967),  241  283. 

[39]  Slemrod.  M.,  Asymptotic  behavior  of  a class  of  abstract  dynamical  systems,  J.  Differential 
Equations  7 (1970),  584  600. 

[40]  Slemrod.  M.,  Asymptotic  behavior  of  periodic  dynamical  systems  on  Banach  spaces,  Ann. 
Mat  Pura  Appl.  86  (1970),  325-330. 

[42]  Slemrod,  M.,  and  Infante,  E.  F.,  An  invariance  principle  for  dynamical  systems  on  a 
Banach  space;  application  to  the  general  problem  of  thermoelastic  stability,  in  “Instability 
of  Continuous  Systems,"  pp.  215  221.  Springer-Verlag,  Berlin,  1971. 

[43]  Slemrod.  M..  and  Infante,  E.  F„  Asymptotic  stability  criteria  for  linear  systems  difference- 
differential  equations  of  neutral  type  and  their  discrete  analogues,  J.  Math.  Anal.  Appl. 
38  (1972),  339-415. 

[44]  Wakeman.  D.  R.,  An  application  of  topological  dynamics  to  obtain  a new  invariance 
property  for  nonautonomous  ordinary  differential  equations,  Ph.D.  dissertation.  Brown 
Univ.,  June  1973  J.  Differential  Equations  17  (1975),  259  295. 

[45]  Walker.  J.  A.,  Liapunov  analysis  of  the  generalizer  Pflueger  problem,  ASME  J Appl. 
Mech.  39  (1972),  935  938. 

[46]  Walker,  J.  A.,  Energy-like  Liapunov  functionals  for  linear  elastic  systems  in  a Hilbert 
space.  Quart.  Appl.  Math.  30  (1973),  465  480. 

[47]  Walker.  J.  A.,  On  the  application  of  Liapunov’s  direct  method  to  linear  dynamical 
systems,  J.  Math.  Anal.  Appl.  (to  appear). 

[48]  Walker,  J.  A.,  and  E.  F.  Infante,  Some  results  on  the  precompactness  of  orbits  of 
dynamical  systems.  CDS  Tech.  Rep.  74-2,  Brown  Univ.  (February  1974)  (to  appear  in 
J.  Math.  Anal.  Appl.). 

[49]  Yoshizawa.  T.,  Asymptotic  behavior  of  solutions  of  a system  of  differential  equations, 
Contrih.  Differential  Equations  I (1963),  371-387. 

[50]  Yoshizawa,  T..  "Stability  Theory  by  Liapunov’s  Second  Method,”  Publ.  No.  9,  Math. 
Soc.  of  Japan,  Tokyo,  1966. 


Topological  Dynamics  and  Its  Relation  to  Integral 
Equations  and  Nonautonomous  Systems 


RICHARD  K.  MILLER- 
Mathematics  Department 
Iowa  State  University.  Ames,  Iowa 

GEORGE  R.  SELL' 

School  of  Mathematics 

University  of  Minnesota,  Minneapolis,  Minnesota 


This  paper  is  dedicated  to  the  memory  of  Professor  Solomon  Lefschetz. 

I.  Introduction 

Our  main  objective  in  this  chapter  is  to  investigate  the  role  of  the  theory  of 
topological  dynamics  in  the  study  of  nonautonomous  ordinary  differential 
equations  and  Volterra  integral  equations.  The  application  of  topological 
dynamics  to  autonomous  differential  equations  is  a classical  theory  with 
its  origin  in  the  works  of  Henri  Poincare  and  George  Birkhoff.  However, 
the  applications  to  nonautonomous  processes  and  integral  equations  are 
rather  recent  with  most  of  the  developments  occurring  within  the  last  decade. 

In  this  chapter  we  hope  to  cover  the  main  developments  in  this  area  within 
the  last  five  years.  Other  contributions  to  this  symposium  cover  such 
important  topics  as  differentiable  dynamics,  dynamical  systems  in  a Banach 
space,  and  the  theory  of  the  dynamics  of  functional  differential  equations. 
So  we  shall  not  touch  on  these  related  topics  here.  For  references  to  other 
results  prior  to  1970,  see  [36,  62,  64,  66]. 

The  main  problem  in  studying  nonautonomous  systems  of  ordinary 
differential  equations  is  the  embedding  of  the  time  parameter  / in  a suitable 
space.  The  well-known  artifice  of  replacing  the  nth-order  nonautonomous 
system  x'  =f(t,  x)  with  an  ( n -I-  l)st-order  autonomous  system  by  setting 
x„  + , = t,  or  x'n+  , = 1,  is  of  little  use  in  studying  stability  questions  or  other 
dynamical  properties.  For  periodic  equations  the  circle  S'  is  a natural  space 
for  embedding  time.  However,  for  aperiodic  equations  one  must  seek  a 
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different  space  for  the  embedding.  The  coefficient /contains  all  of  the  essential 
information  about  time  dependence.  Moreover,  time  translation  of  / 
determines  a How  that  is  compatible  with  translation  of  solutions.  Thus 
time  translates  of  / determine  a natural  embedding  space.  This  observation 
is  certainly  simple  but  has  profound  consequences.  We  shall  examine  this  in 
more  detail  in  Sections  II  and  III. 

In  Section  IV  we  shall  look  at  the  corresponding  dynamical  structure  of 
integral  equations.  Section  V will  be  devoted  to  the  study  of  the  asymptotic- 
properties  of  solutions  and  the  limiting  equations.  The  LaSalle  invariance 
principle  is  very  useful  in  this  context. 

Section  VI  will  be  devoted  to  an  analysis  of  the  theories  of  the  existence 
of  periodic  and  almost  periodic  solutions.  In  Section  VII  we  will  study  the 
role  of  generic  theories  in  the  qualitative  study  of  ordinary  differential 
equations. 

The  last  three  sections  are  concerned  primarily  with  linear  equations. 
Section  VIII  covers  the  theory  of  exponential  dichotomies  and  invariant 
splittings,  and  Section  IX  deals  with  the  structure  of  the  Liapunov-type 
numbers  and  possible  extensions  of  the  Floquet  theory  to  aperiodic  linear 
differential  equations.  Finally,  in  Section  X we  shall  present  two  points  of 
view  that  illustrate  how  the  solutions  of  linear  Volterra  integrodifferential 
equations  can  be  used  to  construct  one-parameter  semigroups  on  a suitable 
Banach  space. 


II.  Local  Dynamical  Systems 

Our  starting  point  will  be  the  notion  of  a local  dynamical  system  on  a 
Hausdorff  space  X. 

Definition  1.  Given  any  point  p in  X , let  l(p)  — (oip,  <op)  be  a given 
interval  with  rp  < 0 < wp . Let 

S = {(r,  p)e  R x X :te  /(/>)}. 

A function  n\  S -*  X is  a local  dynamical  system  or  flow  on  X if  the  following 
hold : 

(i)  7t(0,  p)  = p for  all  p in  X. 

(ii)  If  /e  l(p)  and  se  l(n(t,  p))  then  / + ,s  e l(p)  and  7r(.s,  7t(f,  p))  = 
n(t  + s,  p). 
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(iii)  Each  l(p)  is  maximal  in  the  sense  that  either  ojp  = + x or  the 

closure  of  the  semitrajectory  £ 

7+{p)  = Ml  p) : 0 < t < ojp } 

is  not  compact:  and  similarly  either  yp  — — x or  the  closure  of  the  semi- 
trajectory 

V ( P ) = Ml  P)'-xp<  t < 0} 

is  not  compact. 

(iv)  n:  S -»  X is  continuous.  Moreover,  if  pn->  p in  X,  then  lim  inf  l(pn) 
<=  l(p)  (n-*oc).  (We  shall  occasionally  use  the  notions  of  generalized 
sequences  or  nets  to  describe  various  topological  properties  of  dynamical 
systems;  cf.  [63]  for  more  details.) 

This  definition  of  a local  dynamical  system  reduces  to  the  usual  definition 
of  a dynamical  system  (or  flow)  whenever  I(p)  = ( - x,  x)  for  all  p in  X.  In 
this  case  (iii)  and  the  last  part  of  (iv)  are  automatically  true.  If  we  restrict 
the  intervals  l(p)  to  have  the  form  l(p)  = [0,  o>p),  then  with  the  definition 
of  S and  with  (iii)  appropriately  modified,  n defines  a local  semiflow  on  X. 
If  l(p)  is  always  [0.  x),  7r  is  a semiflow.  For  more  details  see  [31.  36,  62.  63]. 

We  shall  note  that  the  definition  of  a local  dynamical  system  n:  S ->  X 
implies  that  the  domain  S is  open  in  R x X.  In  fact,  an  equivalent  definition 
can  he  formulated  in  terms  of  an  open  domain  for  S (cf.  [69],  for  example). 
Also  we  should  note  that  the  use  of  the  term  “local”  in  the  definition  of  a 
local  dynamical  system  is  a miscasting.  The  mapping  n is  not  defined  only 
locally.  One  can  show  that  the  domain  S is  maximal  in  the  sense  that  n 
has  no  proper  extension  in  K x X to  a mapping  that  is  also  a local 
dynamical  system. 

We  shall  refer  to  the  function  of  /,  7t(l  p)  as  the  motion  through  p. 
We  shall  call  the  point  set 

v(p)  = Ml  p)  ■ t g i(p)} 

the  trajectory  through  p and  ll(p)  = Cl  y (p)  the  hull  of  p (where  Cl  denotes 
the  closure  operation).  Also  //’(/>)  = Cl  y + (p)  and  //  (p)  = Cl  y (p)  will 
denote  the  positive  and  negative  hulls  of  p.  The  y-  and  (o-limit  sets  are  then 
defined  by 

Mp)=  f //  (n(t,p)).  Q(p)  = f)  l/  + (n(t,  p)). 

I f K »€  R 

A set  M S X is  invariant  if  for  all  pe  M one  has  l(p)  = ( — x,  x)  and 
y(p)  £ M.  A set  XI  ^ X is  said  to  be  a minimal  set  if  M is  nonempty. 
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closed,  and  invariant,  and  contains  no  proper  subset  with  these  three  ; 

properties.  .V/  is  said  to  be  an  a.p.  minimal  set  if  .V/  is  minimal  and  the  . 

motions  in  .V/  are  almost  periodic  (cf.  [63]  for  more  details).  A motion  < 

7t{t.  p)  is  said  to  be  positively  compact  if  y * (p)  is  precompact,  i.e..  if  //*(p)  J 

is  compact.  The  following  lemmas  are  easily  established. 

Lemma  2.  For  each  pe  X with  ojp  = + x,  the  limit  set  Q(p)  is  closed 
and  invariant.  If  the  motion  n(t.  p)  is  positively  compact,  then  ojp  = + x 
and  Q(p)  is  nonempty,  compact,  and  connected.  | 

Lemma  3.  If  n(t.  p)  is  positively  compact  and  if  nit.  p)  approaches  a set 
E as  t — + x.  then  7r(r.  p)  -*  M as  t -»  + x.  where  M is  the  largest  invariant  ' 

subset  of  Cl  E. 

I 

Let  G S X and  let  V : Cl  G — R be  a continuous  mapping.  Following 
LaSalle  [14]  we  say  that  V is  a Liapunov  Junction  on  G (with  respect  to  a j 

local  dynamical  system  n)  if  Vln(t.  p))  is  nonincreasing  in  t whenever 
■ *(p)c:  G.  The  following  result  is  an  abstract  formulation  of  the  LaSalle 
invariance  principle:  j 

I 

Theorem  4.  Let  V be  a Liapunov  function  on  G with  respect  to  n.  i 

Let  n(t.  p)  be  a positively  compact  motion  and  assume  that  y *(p)  £ G.  Then  | 

n(t,  p)  -»  M as  t -*  x.  where  M is  the  union  of  all  trajectories  y (q)  with  the 
property  that  l{q)={  — x,  x).  y (q)  a G.  and  V {n{t.  q))  = V (q)  for  all 
t e R. 

This  theorem  together  with  the  construction  described  in  the  next  section 
can  be  combined  to  obtain  results  about  the  limiting  behavior  of  solutions 
of  nonautonomous  differential  equations,  as  in  the  chapter  by  LaSalle. 

There  do  exist  many  variations  of  the  concept  of  a flow.  These  occur 
when  the  real  line  R is  replaced  by  a topological  group  T.  In  this  case,  the 
flow  is  a continuous  mapping  n:  T x X -*  X satisfying  7r(0,  x)  = x and 
n(s,  n(t , x))  = n{s  + r,  x).  Applications  where  T = Z.  the  integers,  are  quite 
common.  A similar  extension  of  a semiflow  is  possible  where  T is  now  a 
topological  semigroup. 

One  very  interesting  application  of  this  more  general  viewpoint  appears 
in  Dafermos  [5,  6],  where  he  studies  the  partial  differential  equation 


G +./  (L')x  — 
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Without  going  into  detail,  let  us  note  that  Dafermos  introduces  the  concept 
of  a process , which  can  be  viewed  as  a semiflow 

n:  R x X x R*  ->  R x X, 

where  R and  R ‘ = [0,  cc)  have  the  discrete  topology  and  X is  a complete 
metric  space. 


III.  Nonautonomous  Ordinary  Differential  Equations 

Consider  the  family  of  nonautonomous  differential  equations 

x'=f(t,x),  (DEI) 

where  / belongs  to  a function  space  ft  of  functions  g:  R x W -*  Rn  (where 
W is  an  open  set  in  R").  We  shall  assume  that  ft  is  a Hausdorff  space  that 

(i)  is  translation  invariant  (i.e.,  if /e  ft,  then  fx  e 5 for  all  re/?,  where 
JT(t,  x)  =/( t + t , x)),  and 

(ii)  satisfies  the  Caratheodory  property  (i.e.,  for  each  / e ft  and  x0  e W 
there  is  a unique  noncontinuable  solution  cp(t,  x0,/)  of 

x'  = /(t,  x),  x(0)  = x0 , 

defined  on  an  interval  /(x0,/)=  (a(x0,/),  w(x0,/)). 

The  following  theorem,  which  is  basic  for  our  theory,  has  been  discussed 
in  a number  of  papers  (cf.  [36,  40,  62,  63]). 

Theorem  5.  Let  ft  be  translation  invariant  and  satisfy  the  Caratheodory 
property.  Then  the  mapping 

n(t,  x,/)  = (<p(t,  x, /),/,)  (1) 

defined  on  {(f,  x,  /) : t e l(x,  /)}  defines  a local  dynamical  system  on 
W x ft  if  and  only  if 

(iii)  the  mapping  (t,/)  ->/t  is  continuous,  and 

(iv)  7t  is  properly  continuous,  i.e.,  the  mapping  (f,  x,  J )-*(p{t , x, /)  is 
continuous  and  /(x,  /)c  |im  inf  /(x„,  /„)  whenever  (x„,  /„)-+(x,  /)  in 
IT  x ft. 

Remarks.  Several  extensions  of  Theorem  5 are  possible.  First  one  does 
not  need  W to  be  open  in  Rn  but  rather  only  that  the  solution  of  the  initial 
value  problem  x'  = f(t,  x),  x(0)  = x0,  remains  in  W.  For  example,  W could 
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be  a manifold  (cf.  [26]).  Also,  it  is  possible  to  eliminate  the  uniqueness  of 
solutions  of  (DEI),  but  in  this  case  the  formulation  of  the  How  n is 
somewhat  different  than  that  given  by  (1)  (cf.  [66]  for  details).  The  above 
formulation  of  the  flow  n is  an  example  of  a skew-product  How  (cf. 
[53.  57]). 

Since  5 is  translation  invariant,  the  continuity  property  (lii)  means  that 
the  mapping  o(x ,/  ) = J.  defines  a How  on  A.  Therefore,  if  / 6 A.  the  trajectory 
■/(  / ).  hull  //(  / ).  and  o>- limit  set  Q(j  ) are  as  defined  in  Section  If.  It  should  be 
emphasized  here  that  the  sets  H{J  ) and  Q( J ) do  depend  on  the  space  if.  If 
one  changes  A (makes  it  larger  or  smaller),  then  the  sets  H(j  ) and  Q(J  ) may 
change,  too.  The  trajectory  y(  J ) is.  of  course,  determined  by  / alone  and  not 
the  space  5. 

In  order  to  apply  Theorem  5.  and  the  corresponding  theory  of  topological 
dynamics,  one  must  determine  a function  space  if  satisfying  conditions 
(i)  (iv).  Often  one  is  given  a specific  equation 

x'  = f(t , x). 

In  this  case  the  trajectory  y(j ) = { J, : t e R\  must  lie  in  5 since  if  is  to  be 
translation  invariant.  The  Caratheodory  property  (ii)  is,  of  course,  the  con- 
clusion of  the  basic  existence  and  uniqueness  theorems.  Next  the  topology 
on  if  must  be  chosen  so  that  (in)  and  (iv)  hold.  In  practice  the  continuity 
of  translations  (hi)  is  not  very  restrictive.  Property  (iv)  is  the  hardest  to 
satisfy,  but  it  too  is  a standard  problem  arising  in  the  theory  of  ordinary 
differential  equations.  Some  recent  research  ot  Z.  Artstein  and  D.  R. 
Wakeman  has  given  new  insight  into  continuity  properly  (iv). 

For  i ^ ().  let  I,  denote  the  closed  interval  in  R with  endpoints  r and  0. 
and  let  C,  = C(l,,  R")  be  the  Banach  space  of  continuous  functions  from 
I,  to  Rn  with  sup-norm.  The  following  result  is  basically  due  to 
Artstein  [2]: 


Theorem  6.  Let  if  be  translation  invariant  and  satisfy  the  Caratheodory 
property.  Then  continuity  property  (iv)  is  satisfied  if  for  every  I ^ 0 the 
mapping 

G{(pJ)=\  f{s,<p(s))ds 
• o 

is  a continuous  mapping  from  C,  x o to  Rn. 
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For  example,  if  is  a translation  invariant  subset  of  C,[R  x W.  Rn), 
the  space  of  continuous  functions  from  R x W to  Rn  that  are  locally 
Lipschit/  in  x.  and  CL(R  x W.  R)  has  the  topology  of  uniform  convergence 
on  compact  subsets  of  R x W,  then  with  the  help  of  Theorems  5 and  6 it 
is  easy  to  see  that  (1)  defines  a local  dynamical  system  on  W x A.  This 
is  the  first  example  studied  by  Miller  [28]  and  Sell  [62]. 

Wakeman  [70]  studied  a more  general  class  of  problems.  His  generalization 
was  needed  in  order  to  study  certain  problems  in  control  theory.  He 
considered  5 the  collection  of  all  functions  /:  R x W -»  Rn  such  that  the 
following  hold : 

(a)  f(t,  .x)  is  continuous  in  x for  each  i and  measurable  in  t for  each  x. 

(b)  Given  any  compact  set  K £ W there  exist  V/  > 0 and  a locally 
/.'-function  m(t)  such  that 

| f{t,  x)|  < M,  | / (f.  .x ) -/(/,  v ) | < m(t)|.x  - y |,  | m(s)  ds  < M. 

* t 

for  all  f 6 R and  x,  ye  K.  The  topology  on  5 is  defined  by  saying  that  a 
generalized  sequence  \f„\  converges  to  / (/„-»/)  if  for  every  T >0  and 
every  compact  set  A'  £ J-F  one  has 

SUP  ! I {/«(■'■.  *)  -f{s.  -x)|  ds  : - T < t < T.  xe  k\  -0.  (3) 

I1  • o I 

Under  these  conditions,  Wakeman  showed  that  ( 1 ) defines  a local  dynamical 
system  on  W x 5. 

The  construction  of  the  How  n described  in  (1)  does  include  the  standard 
theory  for  autonomous  and  periodic  differential  equations  as  special  cases. 
If  ,x'  = f(x)  is  an  autonomous  equation,  then  one  could  choose  to  be  the 
set  {/},  containing  the  single  function  f.  In  the  periodic  case  x'  =/(r,  ,x). 
where  J is  '/  -periodic  in  t.  the  trajectory  -/(/)  is  homeomorphic  to  a circle. 
Also  the  corresponding  flow  7r  on  W x •/(/)  is  equivalent  to  the  (n  + 1 )st- 
order  autonomous  system 

.x0' = 1,  x'=f(x0,x), 

where  we  identify  .x0  = .x0  -I-  T (modulo  T). 

Another  important  class  of  differential  equations  that  we  shall  return  to 
shortly  are  the  almost  periodic  equations  (cf.  [9,  28,  53,  63]). 

Definition  7.  A continuous  function  /:  R x W -*  Rn  is  almost  periodic 
in  t (uniformly  for  .x  in  compact  subsets  of  W)  if  given  any  real  sequence 
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{f„'J  there  is  a subsequence  ]rn!  and  a function  y such  that  J (t  + tn.  x)  -* 
<y(i,  x)  as  n -»  x uniformly  in  (/,  x)  for  t e R and  x in  compact  subsets  of 
W. 

It  is  known  that  (r. /)->/,  defines  a flow  on  the  family  of  all  almost 
periodic  functions  and  further  that  the  hull  H(J')  is  a compact  minimal 
set;  in  fact,  H(J  ) is  a compact  abelian  group  of  functions  (cf.  [63]).  In  the 
almost  periodic  case  one  often  chooses  ft  to  be  the  hull  //(/)  of  an  almost 
periodic  function.  Since  ft  is  then  compact,  it  happens  that  the  phase  space 
W x <V  for  the  flow  n is  locally  compact,  a simple  mathematical  property 
that  pays  countless  dividends. 


IV.  Integral  Equations  and  Semiflows 

A set  of  Volterra  integral  equations  can  be  used  to  construct  a local 
semitlow.  The  construction  is  similar  to  construction  (1)  for  (DEI).  Let 
</>(f.  /.  y)  be  a solution  of 

/ 

x(r)  =J  (t)  + | y (t,  .v,  x{s))ds  (IE  1 ) 

• o 

on  the  maximal  interval  [0,  a(  /,  y)).  Then  formally 

x(uf,  ty)  = |./,(  ) +|  <y (/  + •.  s,  <p(s,J\  </))  ds.  c/,|  (4) 

on  0 < t < a (f  y).  where  y,(u.  s,  x)  = y(t  + u,  s + t.  x).  Eor  appropriate 
spaces  of  pairs  (/,  y),  this  will  define  a local  semiflow  (see  [31.  36]  for 
details).  Moreover,  if  J is  continuous,  then  results  of  Artstein  [2]  give 
necessary  and  sufficient  conditions  for  proper  continuity  of  n. 

Similarly,  if  <p(t)  = ^(r.  x0 , f y)  solves 

, I 

x'(0  =f(L  x(f ))  + | y{t,  s,  x(.s))  ds,  x(0)  = x0.  (IE2) 

• o 

then  under  appropriate  hypotheses  n defined  by 

n(t,  x0j:  y)  = (</>(f),  /,(  ) + | y{t  + •,  .v,  <^(s))  ds,  ty, | (5) 

determines  a local  semiflow  on  triples  (x0 , f,  y).  We  shall  discuss  their 
last  example  further  in  Section  X. 
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V.  Stability  and  Other  Asymptotic  Properties 

Given  a How  n on  W x as  described  above,  we  now  seek  information 
about  the  asymptotic  behavior  of  solutions  as  i-*+x.  The  LaSalle 
invariance  principle  (Theorem  4)  does  give  some  information  by  using  the 
theory  of  Liapunov  functions.  If  one  cannot  find  a suitable  Liapunov 
function,  one  can  still  get  some  information  about  the  solutions  by  using 
the  theory  of  topological  dynamics.  The  most  important  concept  used  in  this 
analysis  is  the  o)-limit  set.  Specifically  one  is  interested  in  determining  the 
dynamical  structure  of  the  co-limit  set. 

In  order  to  show  that  the  co-limit  set  f2(x,  / ) is  nonempty,  we  will  use 
Lemma  2 and  determine  conditions  under  which  the  motion  n(t.  x,  J)  = 
(<p(t.  x, /)./,)  is  positively  compact.  Now  n(i.  x,  f)  is  positively  compact 
when 

(i)  (p(t . v./)  remains  in  a compact  subset  of  W for  all  t > 0.  [If  W = Rn. 
then  this  is  equivalent  to  demanding  that  </?(/,  x . / ) be  bounded  on 
t > ().] 

(ii)  //  '(J  ) is  compact  in  tf. 

In  this  case  il ( / ) will  be  nonempty,  as  well  as  compact.  The  differential 
equations 

x’  = g(t,x),  (DE2) 

are  very  important.  They  are  called  the  limiting  equations  of  / (cf.  [62]). 

The  condition  that  "//*(/)  be  compact  in  5”  naturally  depends  on 
the  choice  of  iv  and  the  topology  on  dr.  For  example,  if  / e C = 
C(R  x W,  R")  (where  C has  the  topology  of  uniform  convergence  on 
compact  sets)  then  //’(/)  is  compact  if  and  only  if  / (/.  x)  is  bounded 
and  uniformly  continuous  on  every  set  of  the  form  [0.  x)  x K.  where  K 
is  compact  in  W (cf.  [62]).  [The  uniqueness  condition  in  the  Caratheodory 
property  would  impose  additional  criteria  on  H*(f).  These  criteria  can  be 
dropped  entirely  if  one  uses  the  point  of  view  of  [66].] 

Using  the  space  described  in  (3),  Wakeman  [70]  showed  that  //*(/)  is 
compact  if  for  each  compact  set  K c W,  there  exists  M > 0 such  that 

[/(/.  x)  - f(t,  y)|  < M\x  — y|  for  all  t > 0 and  all  x,  ye  K. 

It  is  useful  to  recall  that  periodic  motions  and  even  almost  periodic 
motions  J,  are  compact.  Moreover,  if  /,  is  positively  compact  and  if  /i,  — ► 0 
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in  as  t -»  + x,  then  (/+  /i),  = /,  + /i,  is  positively  compact,  and  / and 
/ + h have  the  same  limiting  equations.  Thus  we  see  that  the  semitrajectory 
V * ( / ) can  be  precompact  in  a variety  of  spaces  5.  When  5 itself  is 
compact,  the  LaSalle  invariance  principle,  together  with  the  flow  n given 
bv  (1).  can  be  employed  to  good  advantage.  See  [13,  15,  17.  27.  51.  70.  73] 
for  examples. 

In  addition  to  the  LaSalle  invariance  principle.  Lemma  3 can  sometimes 
be  used  to  obtain  asymptotic  information.  It  is  not  difficult  to  show  that 
boundedness  and  certain  types  of  stability  of  bounded  solutions  (e.g..  uniform 
stability  and  uniform  asymptotic  stability  ) transfer  from  solutions  of 

x'=J(t,x)  (DEI) 

to  solutions  of  the  limiting  equations  (DE2)  for  J (cf.  [62.  63]).  The 
inverse  problem  of  transferring  information  about  solutions  of  the  limiting 
equations  (DE2)  to  solutions  of  (DEI)  is  of  even  more  interest.  This 
reverse  problem  is  called  the  inverse  limit  problem  [64],  Roughly  speaking, 
the  limiting  equations  are  devoid  of  "transients”  and  presumably  should  be 
simpler  than  (DEI ).  Thus  we  wish  to  transfer  information  from  the  simpler 
limiting  equation  to  some  solution  of  (DEI).  While  this  inverse  limit 
problem  has  not  been  thoroughly  studied  (cf.  [64])  the  following  result  of 
Markus  [25]  is  a good  example  of  a theorem  one  might  expect  in  this  area: 

Theorem  8.  Let  /(/,  \)  = Ax  + </(.y)  + h(t.  x)  be  in  C(R  x Rn,  Rn)  with 
h,-*0  as  i-*+x,  |c/(.y)|  = o(|.y|)  as  | .y | — ► 0,  and  A is  a stable  matrix. 
Then  for  all  initial  conditions  .y,  with  |.\  | sufficiently  small,  the  solution 
c p(t.  x ,/)  — » 0 as  t — ♦ + x . 

For  related  concepts  of  stability  as  applied  to  dynamical  systems  and 
differential  processes,  see  [10  12.  71], 

Similar  asymptotic  analysis  questions  can  be  asked  for  integral  equations. 
Lemma  2,  Lemma  3.  and  Theorem  4 can  be  extended  to  semiflows.  These 
extensions  can  be  used  when  n is  determined  by  (4)  or  (5).  Compactness 
of  an  orbit  (4)  depends  on  compactness  of  the  corresponding  orbits  f, 
and  if,  and  the  boundedness  and  uniform  continuity  of  </)(/.  /.  p)  on 
[0.  / ) (cf.  [36]).  Similar  remarks  apply  to  (5). 

As  an  example,  consider 

, f 

•Y  (t)  + | h(x(s))a(t  - s)  <ls  = k(t).  y(0)  = y0.  (IE3) 

• o 
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where  k(  + x)  exists,  ae  L'( 0,  x),  and  both  h and  k aje  continuous.  In 
this  case.  <y(/.  s,  x)  = — h(x)a(t  — x)  and  / (/.  x)  = k(l)  in  (5).  Thus  cy,  = ty 
for  all  f > 0,  /,  - ► k( -t-  x ),  and  so  yyf  and  /,  are  positively  compact.  II  ,v(f) 
is  hounded,  then  the  assumptions  easily  imply  its  uniform  continuity  on 
[0.  x ).  The  limiting  equation  corresponding  to  (IE3)  can  be  written  in  the 
form 

. r 

y '(f)  + | h(x(s))a(t  — s)  ds  = k(  + ac)  ( — x < / < x).  (IE4) 

’ — x 

Analysis  via  limiting  integral  equations  has  been  used  in  [36.  Part  III] 
for  general  equations,  in  [16.  29.  30.  33.  72]  for  asymptotically  autonomous 
equations,  and  in  [21.  32]  for  periodic  and  almost  periodic  limiting 
equations.  The  most  systematic  exploitation  of  limiting  equations  occurs 
m the  work  of  Levin  and  Shea  [ IS  20],  They  always  assume  the  existence 
of  a bounded  and  uniformly  continuous  solution  of  the  integral  equation. 
Their  conclusions  are  usually  stated  in  terms  of  T-sequences.  The  connection 
between  T-sequences  and  topological  dynamics  is  studied  in  [65]. 

Levin  and  Shea  consider  equations  of  the  form 

A"(r)+  f'  h(X(t-s))dA(s)  = F(t).  (1E5) 

* — X 

X{t)+  I h(X(t-s))clA(s)  = F(l).  (1E6) 

* “ X 

h(X(t-s))clA(s)  = F(t),  (IE7) 

* _ X 

on  — x < t < x.  where  A e BV(  — x,  x ).  A(  — x)  = 0,  f'eL'(-x,  x ). 
and  / ( + / ) exists.  They  study  both  linear  and  nonlinear  cases.  Note  that 
(IE3)  and  (IE4)  (for  example)  are  special  cases  of  (IE5).  We  need  only  put 
/l(s)  = 0 if  s < 0,  ,T(s)  = a( s)  on  \ > 0.  define  F(t)  in  the  obvious  way  in 
(IE3)  or  (IE4),  and  put  x(t ) = 0 for  t < 0 in  (IE3).  A typical  result  is  the 
following.  Let  * denote  the  Fourier  transform  and  let  .S’a(A)  = {.s:  A*(s)  = 
— is,  — y < s < x }. 

Theorem  9.  (See  [ 19]).  With  A and  /•  as  above  and  h(x)  = x,  suppose 
(IE5)  has  a bounded,  uniformly  continuous  solution  A (f).  Then  there  exists 
a function  r/( / ) w ith  //( x ) = 0 and 


234 


RICHARD  K.  MILLER  AND  GEORGE  R.  SELL 


such  that: 

(i)  ifSa(A)  = 0,  then  A"(/)  = F(oo)//1(og)  + rj(t),  on  — oc  < t < oc,  and 

(ii)  if  Sa(A)  = {/,,  A kn\  and  kk  # 0 for  all  k,  then 

A'(r)  = F(oo)/4(oo)  + £ C*(r)exp  (akt)  + t/(r)  (6) 

*=  i 

on  — oo  < r < oo,  where  CkeCJ  n//(— oo,  oc)  and  oo)  = 0 for 

\ < k < n and  / = 1,  2,  3,  — 

(iii)  If  Sa(/1)  = {/i,  /2’  • ••»  AnJ  and  some  is  zero,  then  (6)  remains 
true  but  F(oc)  must  be  zero. 

Similar  results  can  be  proved  for  (IE6)  and  (IE7).  Results  can  also  be 
proved  in  nonlinear  cases.  See  also  [65]. 


VI.  Periodic  and  Almost  Periodic  Solutions 

Some  of  the  first  results  that  were  discovered  by  exploiting  the  topological 
dynamical  viewpoint  for  nonautonomous  differential  equations  concern  the 
existence  of  periodic  and  almost  periodic  solutions.  The  basic  idea  here  is 
that  one  can  describe  the  structure  of  an  co-limit  set,  usually  in  terms  of 
certain  stability  concepts.  The  early  results  in  this  direction  can  be  found  in 
[7,  28,  61].  A different  class  of  existence  theorems  for  almost  periodic 
solutions  was  started  by  Amerio  [1]  and  later  extended  by  Fink  [8]  and 
Seifert  [59],  These  theorems  are  based  on  various  separatedness  concepts. 

Recently  Sacker  and  Sell  [53,  57],  using  the  dynamical  concept  of  a 
distal  flow,  have  presented  a theory  in  the  context  of  skew-product  Hows 
that  combines  both  of  these  approaches  as  special  cases  of  the  following 

Theorem  10.  Let  n be  the  flow  described  by  (1)  on  W x ft,  where  5 is 
the  hull  of  an  almost  periodic  function/:  R x W ->  R".  Let  M be  a compact 
invariant  set  in  W x ft.  Let  p\  W x ft-*  ft  be  the  projection  mapping. 
Assume  that 

(i)  Card  (p  '(g)  n M)  = N < cc  for  some  y e 5,  and 

(ii)  n has  the  distal  property  on  M. 

Then  M is  an  N- fold  covering  space  of  ft,  and  M can  be  written  as  the 
finite  union  of  almost  periodic  minimal  sets.  In  particular,  for  every  (v,  / ) e M 
the  solution  (p(i , x,f)  is  an  almost  periodic  function  of  t. 
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In  applications  the  set  M is  often  chosen  to  be  the  a»-limit  set 
«(Wo  ) of  a positively  compact  motion  n(t,  x0,f0).  The  distal  property 
on  M means  that  if  (x,,  /),  ( x2,/)eM  and  x,  ^ x2  then  there  is  an 
i > 0 such  that  | <p(r,  xt,  /)  — <p(f,  x2,  /)|  > a for  all  t e R.  This  distal 
property  and  the  finiteness  property  (i)  are  weaker  than  the  uniform 
stability  properties  of  [7,  28,  61]  and  the  separatedness  properties  of 
[1,  8,  59],  For  example,  the  following  result  of  Miller  [28]  can  be  derived 
as  a corollary  of  Theorem  10.  Let  3 be  the  hull  of  an  almost  periodic 
function,  where  3 satisfies  conditions  (i)  (iv)  in  Section  III.  If  there  exists 
a bounded  uniformly  asymptotically  stable  solution  of  x'  = /(f,  x)  for  some 
/ e 3,  then  there  exists  an  almost  periodic  solution  for  every  equation  in 
3.  (See  [57]  for  details.) 


VII.  Generic  Theory 

The  point  of  view  underlying  the  “generic  theories”  of  differential 
equations  is  a natural  development  in  our  study  of  the  qualitative  behavior 
of  differential  equations.  Prior  to  the  commencement  of  the  study  of  the 
qualitative  behavior  of  ordinary  differential  equations,  which  began  with 
Poincare  nearly  one  hundred  years  ago,  the  emphasis  in  differential  equations 
had  been  in  the  area  of  quadratures,  which  involves  either  the  explicit  solution 
of  differential  equations  or  expressing  such  solutions  in  terms  of  infinite  series. 
If  one  recalls  the  application  of  the  Poincare- Bendixson  theory  to  the  Van 
der  Pol  equation 

u"  + t:(u2  - 1 )w'  + m = 0, 

one  gets  an  existence  theorem  for  a nontrivial  periodic  solution,  but  not 
an  explicit  or  even  an  implicit  formula  for  this  solution.  As  is  well  known, 
the  Poincare  Bendixson  theory  does  exploit  the  special  topological  structure 
of  the  plane  as  expressed  in  the  Jordan  curve  theorem.  As  one  passes  to 
differential  equations  defined  on  higher-dimensional  spaces,  one  is  often 
forced  in  the  qualitative  theory  to  accept  weaker  conclusions.  One  such  class 
of  conclusions  is  described  in  what  is  called  the  “generic  theories.” 

The  basic  tenet  of  generic  theories  is  to  prove  theorems  of  the  following 
type: 

A “typical”  solution  (function)  has  property  “P” 
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Presumably  in  trying  to  establish  a generic  theorem  as  stated  above,  one 
is  given  a system  (or  a collection  of  solution  functions)  and  one  is  investigating 
a certain  property  P.  It  remains  then  to  decide  on  the  meaning  of  the 
word  “typical.”  Experience  has  shown  that  two  concepts  are  commonly 
accepted,  one  measure-theoretic  and  the  other  metric-theoretic,  viz.: 

(i)  Let  (.V,  HI.  //)  be  a measure  space.  A measurable  subset  A c x is 
“typical”  if  the  complement  X - A has  measure  zero. 

(ii)  Let  .V  be  a complete  metric  space.  A subset  A c X is  "typical”  if  it 
is  residual,  i.e..  A is  large  in  the  sense  of  Baire — or  A is  the  countable 
intersection  of  open  dense  sets. 

The  Fubini  theorem  for  measures  on  product  spaces  would  be  an  example 
of  the  generic  theorems  of  a measure-theoretic  nature,  and  Sard's  theorem 
would  be  an  example  of  the  generic  theorems  of  a metric-theoretic  nature. 
In  the  theory  of  topological  dynamics  one  finds  several  generic  theorems. 
One  is  the  Poincare  Caratheodory  recurrence  theorem: 

Theorem  11.  Let  n be  a flow  on  a compact  space  X and  let  p be  a 
positive  invariant  measure  on  X,  i.e.. 

H(n(t.  ,-1))  = //(/I) 

for  all  Borel  sets  A c x and  all  t e R Then  the  set  P of  Poisson  stable 
points  in  X is  typical  in  X,  i.e..  /i(.Y  — P)  — 0. 

Recall  that  a point  />  e X is  Poisson  stable  if  for  every  neighborhood  U 
of  p the  sets 

{f : t > 0 and  rc(/.  p ) e V),  {t  : t < 0 and  n(t,  p)  e U}, 

are  unbounded  in  R (cf.  [50]).  The  Poincare  Caratheodory  recurrence 
theorem  also  has  a metric-theoretic  counterpart  (cf.  [52]).  Another  generic- 
theorem  is  the  following  ergodic  theorem  of  Birkhofl.  (We  shall  use  the 
fact  that  every  a.p.  minimal  set  is  the  space  of  a compact  abelian  group 
and  therefore  supports  a Haar  measure  that  is  an  invariant  measure  for  the 
flow  : cf.  [49.  63].) 

Theorem  12.  Let  ,Y  be  an  a.p.  minimal  set  for  a flow  n and  let  p be 

the  Haar  measure  on  X with  p(X)  = 1.  Then  for  every  bounded  measurable 

function  /:  X -»  R one  has 

, | • 1 
| f(.x)n(Jx)  = hrn  | J (n(s,  x))  ds 

■ X I -x  ' 1 0 


for  almost  all  v e X . 
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These  are  two  examples  of  generic  theorems  that  we  shall  use  later.  Other 
examples  in  the  theory  of  topological  dynamics  and  differential  equations 
can  he  cited.  One  very  important  area  is  in  the  theory  of  structural 
stability,  which  is  given  in  the  chapter  by  Shub  in  this  volume. 

Let  us  conclude  this  section  by  considering  one  application  of  the 
Poincare  Caratheodory  recurrence  theorem  to  a control-theoretic  problem 
arising  in  celestial  navigation  [26].  The  physical  description  of  this  space 
rescue  problem  has  been  treated  elsewhere  [26,  64]  so  we  shall  turn  directly 
to  the  mathematical  description. 


Theorem  13.  Let  W be  a compact  Riemannian  manifold  and  let 
f(t,  v,  u)  be  a time-varying  ("‘-vector  field  depending  on  a control  parameter 
u e LI.  where  Q is  a compact  neighborhood  of  the  origin  in  Rm.  Assume  that 
/ and  its  covariant  spatial  derivative  are  uniformly  bounded  for  ( t . x,  u) 
e R x W x Q.  Let  A denote  the  hull  of  f where  u is  treated  as  an 
independent  parameter.  Define  the  attainable  set  A(t0,  x0;  t)  as  the  collec- 
tion of  all  </>([»],  t0.  x0;  t0  + t).  where  i i(t)e  Q for  l0  < t < t0  + r and  >p 
is  the  corresponding  solution  of 

x'  = /(f,  x,  u(0).  -v(f 0)  = x0 . (DE3) 

Assume  that 

divx  / (r.  x,  0)  = 0 on  R x W. 

and  that  there  is  an  >/  = r/(r ) > 0 such  that  for  all  (/0,  x0)e  R x W.  the 
attainability  set  /l(/0.  x0;  r)  contains  a ball  of  radius  //  centered  at 
'/>([<>]•  <o-  x o : t0  + t). 

Let  n be  any  invariant  measure  on  ft  with  //( ft)  — 1.  Then  there  is  a 
measurable  subset  6>  c ft  such  that  //( ft  — (0)  = 0 and  if/e  (0.  then  there  is 
a T > 0 such  that 

A(t0*  x0;  s)  = W for  all  s > T.  (7) 

The  conclusion  (7)  says  that  one  can  steer  from  the  initial  point  x0  (at 
time  /0)  to  any  point  in  the  space  W at  precisely  the  time  t = s.  In  the 
physical  problem  alluded  to  above,  this  means  that  one  can  effect  the  rescue 
of  any  derelict  space  ship,  provided  the  equations  of  motion  lie  in  the 
generic  space  63.  i.e.,  with  probability  1 rescue  is  possible.  The  proof  of  this 
is  based  on  an  application  of  the  abstract  Poincare  Caratheodory  recurrence 
theorem  to  the  How  n on  the  product  space  W x ft  together  with  the 
Fubini  theorem. 

Theorem  13  then  is  another  example  of  a generic  theorem.  It  is  also  a 
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good  example  of  a theorem  whose  proof  uses  the  construction  described  in 
Section  111  in  an  essential  way.  It  does  not  appear  that  Theorem  13  is 
provable  without  using  this  construction  (cf.  [64]). 

At  this  point  one  might  object  to  the  generic  theories.  The  vector  field 
/(f,  x.  u)  is  determined  completely  by  the  gravitational  field  of  the  large 
celestial  bodies  and  cannot  be  changed.  Therefore  one  might  ask,  “What 
happens  to  the  space  rescue  if  my  gravitational  field  / lies  in  the  negligible 
set  ft  — <&?”  The  mathematical  answer  would  be.  “Well  that  would  be 
unfortunate,  but  after  all  it  is  not  very  likely!”  This  answer,  while 
irrefutable,  is  not  very  reassuring  to  the  people  living  in  the  (ft  — (£>) 
worlds. 

However,  in  the  almost  periodic  case  there  is  hope!  One  cannot  prove 
directly  that  (ft  — CO)  is  empty  in  this  case,  since  the  construction  of  G is 
irretrievably  lost  in  the  proof  of  the  Poincare  C’aratheodory  recurrence 
theorem.  However,  one  can  adapt  a special  control  strategy,  which  works 
even  for  the  (ft  — CO)  worlds,  to  show  that  conclusion  (7)  is  valid  for  all 
j e ft.  This  strategy  uses,  in  an  essential  way,  Theorem  13  together  with 
certain  crucial  properties  of  almost  periodic  functions  (cf.  [26]).  So  Theorem 
13,  with  all  its  imperfections,  actually  saves  the  day  for  the  almost  periodic 
worlds ! 

Another  class  of  generic  theories,  which  we  shall  discuss  in  more  detail 
shortly,  concerns  the  behavior  of  solutions  of  linear  differential  equations 
(see  also  [40.  42,  43.  45,  47]). 


VIII.  Linear  Theory:  Dichotomies  and  Invariant  Splittings 

In  the  case  of  a linear  differential  system 

x'  = A(t)x , (DE4) 

where  x e X (here  X = Rn  or  C")  and  A e'H.  the  corresponding  How 

n(t,  x.  A)  = x,  A),  A,)  (8) 

has  the  property  that  the  mapping  </>(/.  x.  A)  is  linear  in  x.  Therefore,  one 
can  write 

</>(/,  x,  A)  = <l>(f,  A )x, 

where  <!>(/,  A)  is  a linear  transformation  on  X.  <!>(/,  A)  is  referred  to  as  the 
fundamental  solution  of  (I)H4).  Because  of  the  linearity,  the  How  n given 
by  (8)  is  called  a linear  skew-product  flow,  LSPF  for  short. 
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As  noted  above,  we  will  assume  that  the  matrix-valued  function  A(i), 
defined  for  — x < t < x,  is  a point  in  a function  space  91.  In  addition 
to  conditions  (i)  (iv)  of  Theorem  5,  throughout  this  section  we  shall  assume 
that  91  is  a compac  t set.  for  example.  91  may  be  the  hull  of  a bounded 
uniformly  continuous  function  A(t). 

Next  define  the  bounded  set.  the  stable  set.  and  the  unstable  set  by: 

9*  = {{x.  A)  : \<p{t.  x , A)  \ is  uniformly  bounded  for  t e R\, 

2 = {(x.  A) : \ cp(t,  x,  A)\  — ► 0 as  t -*  4-  x{, 

U = [ (x,  A ) : | cp(t,  x,  A ) | -*■  0 as  t -»  — cc  {. 

Also  define  the  fibers 

93(/l)  - {xe  X :(x.  A)  e «}, 

3(/f)  = {xe  X : (x.  A)e  2], 

U(/l)  = {xe  X : (x.  A)e  ll}. 

Because  of  the  linearity  of  (DE4),  the  fibers  93(4 ).  2(/l ).  and  \\(A ) are  linear 
subspaces  of  X . 

Definition  14.  The  LSPF  n given  by  (X)  is  said  to  admit  an  exponential 
dichotomy  at  A e 91  if  there  is  a projection  P = lJ(A)  on  X and  positive 
constants  a,  K such  that 

\<V(t.  A)P(A)<V  1 (.s,  /4)|  < Ke  s < /, 

|0*(r,  A )[/  - P(A)]<t>  1 (s,  A ) | < Ke  ^ t < s. 

It  is  shown  in  [54]  that  if  n admits  an  exponential  dichotomy  at 
A e 91.  then  Z(A)  is  the  range  of  P(A).  U(/l)  is  the  null  space  of  P(A).  and 
9H-4 ) - [0].  Furthermore,  in  certain  important  cases,  it  is  shown  that  if  n 
admits  an  exponential  dichotomy  at  A e 91  then  9f  is  trivial,  i.e., 
91  = {()]  x 91. 

The  converses  of  the  above  theorems  have  been  studied  recently  by 
Sacker  and  Sell  [54  56]  and  Selgrade  [60].  The  next  result  (see  [55])  shows 
that  the  assumption  that  93  be  trivial  leads  to  some  rather  surprising 
conclusions  about  the  flow  a{A,  x)  — Ax  on  the  base  space  91. 

Theorem  15.  Assume  that  93  = {()[  x 91,  where  91  is  compact.  Define 
9lk  = {A  e 91  : dim  3(/1 ) = k and  dim  U(/1 ) = n — k\. 


for  k = 0,  1,  ...,  n , where  n — dim  A'.  Then  each  9(k  is  a compact 
invariant  set  in  the  flow  o on  91.  Furthermore,  every  a-  and  w-limit  set  for 
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the  flow  n lies  in  precisely  one  31*  Finally,  either  31  = 3l4  for  some  k.  or 
there  are  at  least  two  nonempty  3lt's.  In  the  latter  case,  if 

L = min, A : 3lk  is  nonempty;, 
then  31,  is  a stable  attractor  for  the  flow  a. 

In  the  special  case  where  \M  is  a compact  minimal  set  one  can  say  more 
(cf.  [54.  60]) 

Theorem  16.  Assume  that  33  = ;b  x 31,  where  31  is  a compact  minimal 
set  Then  the  following  conclusions  are  valid 

(i)  There  is  an  integer  k such  that. 

dim  3(4)  = k and  dim  11(4)  = n — k 

for  all  -I  €31.  where  n = dim  .V.  and  .V  = 3(4)  4-  11(4)  for  all  f 31 

(n)  3(4)  and  11(4)  vary  continuously  in  4 €31.  i.e . the  mapping 

[J  .V  x 31  -*  .V  defined  by 

(x,a)-  P(4)x, 

where  P|4)  is  the  projection  on  .V  with  range  =3(4)  and  null  space 
= U(4),  is  jointly  continuous  in  (x.  4). 

(in)  The  rate  of  decay  in  3 and  II  is  exponential. 

(iv)  n admits  an  exponential  dichotomy  at  every  4 € 31  where  the 
associated  projection  P(4)  is  given  in  (ii). 

(v)  3 and  U are  subbundles  of  .V  x 31  and  .V  x 31  = 3 + U.  a Whitney 
sum 

Conclusion  (v)  is  simply  a reformulation  of  conclusions  (i)  and  (ii)  (see 
[6X]  for  more  details). 

The  compactness  of  the  base  31  is  crucial  in  the  theory  Very  little  could 
be  done  without  it.  For  example,  compactness  is  used  to  prove  the 
uniformity  of  various  growth  estimates.  The  stable  and  unstable  sets  3 
and  U are  defined  with  no  reference  to  the  rate  ol  decay,  but  in 
conclusions  (in)  and  (iv)  one  concludes  an  exponential  rate  of  decay, 
which  is  in  fact  uniform  over  31 

Several  generalizations  of  Theorems  15  and  16  are  possible.  First,  the 
theory  does  not  really  depend  on  the  differential  equation  (DF.4)  but 
instead  is  a theory  about  a linear  skew-product  flow,  that  is.  a flow  n on  a 
product  space  .V  x V 

n{t.  x,  y)  = (tpU.  x.  >).  tf(t,  y)). 
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where  the  second  coordinate  a depends  only  on  y (hence  a skew-product 
How)  and  the  first  coordinate  <f>  is  linear  in  v (hence  a linear  skew-product 
How)  (cl.  [ 14,  53,  54,  57]).  Second,  it  is  not  crucial  that  the  phase  space 
of  the  LSPI  n be  a global  product  space;  it  is  sufficient  if  the  phase- 
space  is  a vector  bundle,  which  is  locally  a product  space.  In  this  way  the 
theory  described  in  Theorems  15  and  16  extends  to  the  study  of  the 
linearized  flow  on  a tangent  bundle  TM  generated  by  a smooth  vector  field 
on  a compact  manifold  M (cf.  [54,  60,  68]  for  more  details). 


IX.  Asymptotic  Behavior  of  Linear  Equations 

Let  us  now  turn  our  attention  to  a linear  differential  system  with 
almost  periodic  coefficients: 

x' = A(t)x,  (DE4) 

where  x e X Let  denote  the  hull  of  A = /!(/)  and  let 

n(t,  v.  A)  = (<p(/,  x,  A),  A,)  (8) 

denote  the  corresponding  LSPb  on  X x S,M. 

I lie-classical  approach  to  the  study  of  the  asymptotic  behavior  of  solutions 
ol  (1)1.4)  has  been  in  terms  of  the  Liapunov-type  numbers,  or  the 
Liapunov  characteristic  exponents.  Lor  each  (x,  A)e  .V  x \U  with  x ^ 0,  the 
four  Liapunov-type  numbers  are  defined  by 

A)  = lim  sup  * log  \\<p(T,  x,  /1)||, 

7 - -f  or  ' 

A, f (x,  A ) = lull  inf  * log  \\(p(T,  x,  zl)||, 

r + ou  * 

(x,  A)  =■  lim  sup  * log  ||</>(T,  .x,  /1)||, 

/ ♦ - OD  ' 

/,  (x.  A)  = lim  inf  log  \\<p(T,  x,  A)\\. 
r-*  / ' 

Let  A,  '(A)  {/v'(x.  A):  x / 0}  and  similarly  define  A,  + (zt),  Av  (.4),  and 

A,  {A).  It  is  not  difficult  to  show  that  each  of  the  A(.4)-sets  contains  at 
most  n distinct  points,  where  n = dim  X. 

( )ne  ol  the  main  problems  arising  in  the  study  of  the  asymptotic  behavior 
ol  solutions  ol  (1)1.4)  is  to  determine  the  stability  or  continuity  of  the 
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A(.4)-sets.  In  general,  one  does  not  have  continuity,  hut  the  theory  is  very 
complicated.  We  refer  the  reader  to  [22  24.  37  4N]  for  more  details. 

Under  certain  circumstances  though,  one  can  prove  a generic-type 
continuity  for  certain  characteristic  exponents.  For  example.  Millionscikov 
[37.  40,  42.  43.  45.  4'']  has  used  somewhat  different  definitions  for  the 
characteristic  exponents  and  shown  the  continuity  of  A(.-l)  for  .4  belonging 
to  a generic  set  in  91.  Let  us  illustrate  this  with  Millionscikov's  theory 
of  the  probable  spectrum  [47], 

Let  n = dim  X.  Then  for  any  nonsingular  n x n matrix  B.  the  matrix  B*B 
(where  B*  denotes  the  adjoint  matrix)  is  nonsingular  and  self-adjoint. 
Furthermore.  B*B  has  precisely  n eigenvalues  (counting  multiplicity  ) all  of 

which  are  positive  real  numbers.  L.et  </,(#) </„(#)  denote  the  positive 

square  roots  of  these  eigenvalues  ordered  by  </,(#)  > ilz(B)  > •••  >dn(B). 

Now  consider  \ = .4(r).\,  where  .4  e9L  For  / = 1.2 n.  define 

1 s„‘ 

v,(.4)  = lim  lim  sup  ^ log  J,(<J>(r.  AJt))  . 

t — *■  x V-  - i j - o 

Since  «l»(r.  1)  is  continuous  in  1 and  the  functions  </,(B)  are  continuous,  it 
is  not  difficult  to  see  that  for  each  /.  the  function  v,(.4)  is  a hounded 
measurable  function  of  .4  e9L  where  the  measure  of  \’l  is  the  Haar  measure 
//.  Since  \’l  is  an  a.p.  minimal  set.  the  Birkhoff  ergodie  theorem  12  can 
be  applied,  and  we  conclude  that 


vi(  -4 ) = | vi(A)n(J.4)  = V.  (9) 

• 'll 

for  almost  every  .4  e 91.  The  collection  of  real  numbers  |v( vnJ  is  the 

probable  spectrum  of  .4.  We  see  then  that  the  set  { v t ( .4 ) \ „( .4 )[  is 

constant  over  the  subset  \M0  S 91  for  which  (9)  is  valid. 

More  recently,  a somewhat  different  theory  has  been  developed  by  Sacker 
and  Sell  (5Xj  (also  see  [60.  67])  to  analyze  the  asymptotic  behavior  of 
solutions  of  (DE4).  This  theory,  while  still  in  its  early  stages  of  development, 
appears  to  give  new  insight  into  the  structure  of  the  Liapunov-type  numbers. 
Let  us  now  look  at  the  details. 

For  /.  e K define  the  LSPF  nf  on  X x 91  by 

x,  A)  = (e  >(r.  v.  .4).  A,).  (10) 

It  is  elementary  to  show  that  for  each  n , is  a How  on  A-  x 91  For  each 
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/.  e R.  we  define  the  corresponding  bounded  set  2V , stable  set  3, . and 
unstable  set  U,  by 

'2V  = J(x.  A) : | x,  .4)|  is  uniformly  bounded  for  t e R J. 

3,  = {(x,  A)  : | e 'l(p(t.x,  /f)|-0as  t -+  + x}, 
ll,  = {(x,  A):\e  ;>(r,  x,  A)\ ->  0 as  t - - x }. 

Since  21  is  an  a.p.  minimal  set.  it  follows  from  Theorem  16  that  if 
2V  - |0]  x 21,  then  3/  and  ll;  are  invariant  subbundles  of  X x 21  and 

X x 21  = 3^  + U;  (Whitney  sum). 

We  define  the  resolvent  of  21  as 

p(2l)  = {keR  : 23.  = {0}  x 21}. 

The  complement  of  p(2l)  is  the  spectrum  of  21.  i.e., 

cr(2l)  = R - p(2l). 

The  next  theorem  describes  the  basic  properties  of  the  spectrum  a(2()  and 
the  corresponding  LSPF  n given  by  (8)  (cf.  [58]): 


Theorem  17.  Let  n be  the  LSPF  on  X x 21  defined  by  (8).  where  21 
is  a compact  minimal  set  and  dim  X > 1.  Then  the  spectrum  a(2l)  is  a 
nonempty  compact  subset  of  R consisting  of  k nonoverlapping  intervals 
[u,.  bj.  where  k < dim  X.  Moreover,  associated  with  each  spectral  interval 
[a,,  b,]  there  exists  a nonempty  invariant  subset  CO,  c:  X x 21  with  the 
following  property  : 

(i)  For  each  A e 21  the  fiber 

(S>i(A)  = {xeX:(x.A)e  CO,] 

is  a linear  subspace  of  X . 

(ii)  There  is  an  integer  > 1 such  that 

dim  CO, (/I ) = nt 

for  all  A e 21. 

(m)  (0,(/l ) varies  continuously  in  A e 21  in  the  sense  that  there  exists  a 

continuous  mapping  P,:  X x 21  -»  X given  by 

Pi(x,A)  = Pi(A)x, 

where  P,(A)  is  a projection  on  .V  with  range  =(0,(.4).  (This  means  that 
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each  (ft,  is  a nontrivial  invariant  suhhundle  of  X x \?l.)  Furthermore  one 
has  nx  + • • • 4-  nk  = dim  A'  and 

A = CO,  (.4)  + •••  + COJ/t ) 

for  every  A e sJl  (This  means  that 

X x = (&!+•••  + C0k 

is  a Whitney  sum.) 

What  does  this  result  tell  us  about  the  Liapunov-type  numbers?  Well  if 
(x,  ,4)e  (v»,  and  x ^ 0,  then  one  can  show  that  the  four  corresponding- 
type  numbers  //  (x.  A).  /.<  ’ (x,  .4).  /.s  (x.  .4).  and  /.,  (x.  .4)  lie  in  the 
corresponding  spectral  intervals  [u,,  />,].  Also  one  can  show  that  in  the 
autonomous,  or  periodic,  case  the  four  A(.4)-sets  are  the  same  and  they 

agree  with  (i)  the  probable  spectrum  [vq vk J as  well  as  with  (ti)  the 

spectrum  rr(\’l)  defined  above.  The  relationship  between  the  probable 
spectrum  and  rr('Jl)  in  the  almost  periodic  case  is  not  known,  although  it 
seems  reasonable  to  conjecture  that 

probable  spectrum  £ rr('?l). 
when  s?l  is  an  a.p.  minimal  set. 

The  study  of  the  asymptotic  behavior  of  solutions  of  (DE4)  in  the 
aperiodic  case  has  over  the  last  several  years  been  viewed  as  an  attempt 
toextend  the  Floquet  theory  to  nonperiodic  systems.  An  extensive  discussion 
of  the  " Floquet  problem  for  almost  periodic  systems”  can  be  found  elsewhere 
[67]  so  we  will  not  go  into  details  here.  Suffice  it  to  say  that  the  problem 
reduces  to  asking: 

Do  the  spectral  intervals  [a,,  6,]  reduce  to  points?  (II) 

An  affirmative  answer  to  this  would  be  particularly  fortunate,  because  in 
this  case,  whenever  (x.  .4)  belongs  to  the  associated  subbundle  CO,  (with 
x ^ 0).  then  the  four  Liapunov-type  numbers  must  agree  and  be  equal  to 
a,  ( = bj ).  In  other  words,  the  two  limits 

lim  ],  log  <p{T,  x.  A)  , lim  log  <p(T,  x.  A)  . 

7 - + x ' T-- * * 

exist  and  are  equal. 

!n  closing  we  should  note  that  it  is  not  at  all  clear  at  this  time  why  (II) 
should  always  have  an  affirmative  answer  in  the  almost  periodic  case. 
R McCiehee  has  constructed  a scalar  equation 

x'  = a(t)x 
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(where  xe  R a e AM.  and  \H  is  a compact  minimal  set)  with  the  property 
that  the  spectrum  rr('.M)  consists  of  a nontrivial  interval.  In  McGehee's 
example  the  function  a(t)  is  not  (in  fact,  it  cannot  he)  almost  periodic  in  I. 


X.  Semigroups  and  Integrodifferential  Equations 

We  wish  to  study  the  Hows  defined  by  (IE2)  for  the  case  where  the 
coefficients  of  (IE2)  are  linear  and  of  convolution  type,  i.e., 

t 

x'(t)  = Ax(t)  + I B(t  - s)x(.s)  ds  + F(t),  v(0)  = x0 , t > 0,  (1E8) 

• o 

where  Bel'( 0.  x).  If  x(t)  = T(r)  is  given,  TeEf-x,  0),  and  if  x(t) 
solves  the  system 

# t 

x'(t ) = Ax(t)  + [ B(t  - .v)x(.v)  ds  +f{t),  t > 0,  (IF.9) 

* - X 

then  x(r)  will  solve  (JE8)  with 

F(t)  =J  (t ) + | B(t  - s)T(.v)  ds,  x0  = T(0). 

* — X 

Barbu  and  Grossman  [3]  study  the  asymptotic  behavior  of  solutions  of 
(IE9)  by  using  that  equation  to  define  a C0-semigroup  and  then  applying 
the  theory  of  semigroups.  They  put 

X i = BC\(  — x,  0]  = {/e  G(-x,  ()]:/(  — x)  exists  and  is  finite). 

Given  T in  X x,  let  x0  = T(0)  and  let  </>(/.  x0,  V)  be  the  solution  of  (IF.9) 
with  x(t)  = T(/)  on  t < 0.  Define 

T(t )T  e X , by  T(t )T(.s)  = <p(t  + s,  x0 , T ) on  - x < s < 0. 

It  is  shown  in  [3]  that  7(f)  determines  a C0-semigroup  on  X v The  same 
construction  works  in  other  spaces,  e g.,  X p = X , n I!’(  — x,  0).  They  obtain 
exponential  estimates  on  growth  of  solutions  under  various  assumptions 
on  the  roots  of  the  determinant  det  (si  - A - B*(s))  = 0,  where  s is  a 
complex  variable  and  * denotes  Laplace  transform. 

Miller  [34]  employs  construction  (5)  of  Section  IV  in  order  to  construct 
a semigroup.  Let 

C 0(  — x,  0]  = {/:  ( — x,  0]  -»  R":  f is  continuous  and  has  compact  support). 
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WHO  = | Bit  - s)f(s)  ds,  t > 0,  fe  C0(  - x.  0]. 

* - X 

Let 

Y0  = {(f( 0),  </>(/)): /e  C0(  — x,  0]} 

he  the  linear  space  with  norm 

(./  (0),  </>(/))  = |/(0) | + sup  {|<p(/)(f)| : t > 0}, 

and  let  Y be  the  completion  of  Y0 . For  any  pair  (x0,  F)  in  K if  <p(t , x0,f) 
solves  ( I E 8 ),  then  (5)  becomes 

n(t,x0,F)=  |<p(f,  x0,  F),  F,()  + ) B(s  + ■ )(p(t  — s,  x0 , F)  d.vj. 

It  is  shown  that  7r(t,  x0F)  = T(t)(x0,  F)  determines  a C0-semigroup  on  Y. 
The  main  result  in  [34]  is  the  following: 


Theorem  18.  Let  6eL'( 0,  x)  and  let  Y,  n , and  T(f)  be  as  defined 
above.  Then  det  (si  - A — B*(s))  # 0 in  the  half  plane  Re  ,s  > 0 if  and  only 
if  T(t)  is  asymptotically  stable  in  the  sense  that 

(a)  sup  | T(t)  | : t > 0}  < x,  and 

(b)  for  any  y = (x,  F)  in  Y,  T(t)y  -» 0 as  t -*  x. 

Since  the  determinantal  condition  seems  to  be  the  standard  asymptotic 
stability  condition  for  (IE8)  (see,  for  example,  [35]),  then  its  equivalence 
with  (a)  and  (b)  is  a bit  surprising.  Normally  for  C0-semigroups  we  expect 
and  hope  for  exponential  decay  in  the  asymptotically  stable  case. 
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Here  we  consider  hyperbolic  systems  of  first-order  partial  differential 
equations  in  the  canonic  forms  of  Courant  Lax  and  of  Schauder.  For  these 
systems  in  a slab  Da  — [0  < x < a,  y e Er ] of  the  xy-space  £r+  \ we  formulate 
certain  boundary  value  problems,  and  we  state  corresponding  theorems  of 
existence  of  the  solutions,  and  of  their  uniqueness  and  continuous  dependence 
on  the  data  (Theorems  A and  B in  Sections  I and  2)  [2.  3].  In  particular, 
if  all  the  data  are  periodic  of  given  periods  in  the  y-coordinates,  then  the 
solutions  are  also  periodic  of  the  same  periods.  Theorems  A and  B contain 
a specific  hypothesis,  which  can  be  briefly  stated  by  saying  that  the  relevant 
matrices  have  "dominant  main  diagonal.”  In  Section  3 we  show  by  an 
example  that  the  conclusions  of  the  same  theorems  may  not  hold  if  such 
specific  condition  is  not  satisfied.  In  Section  5 we  describe  a boundary 
value  problem  of  the  above  type,  which  has  been  proposed  as  a simple 
preliminary  model  for  an  otherwise  extremely  complex  resonance 
phenomenon  of  nonlinear  optics.  By  a well-known  algebraic  process 
(Section  4)  we  then  show  in  Section  5 that  the  problem  can  be  reduced 
to  a Schauder  system  with  relative  boundary  conditions,  to  which  Theorem  B 
applies. 


1.  A Boundary  Value  Problem  for  Courant-Lax 
Hyperbolic  Systems 


Quasi-linear  hyperbolic  systems  of  first-order  partial  differential  equations 
are  often  written  in  the  Courant  Lax  canonic  form 


fajdx  + X Pr*(*.  T 

*=  i 


■)Wdyk)  =f{x,  y,  z). 


(x,  y)  e D,  i = 


* This  research  was  partially  supported  hy  AFOSR  Research  Project  71-2122  at  the 
University  of  Michigan 
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where  x,  >'  = (.Vi-.  • ••,  >’r)  are  the  r + 1 independent  variables,  r > 1, 
z = z(x,  y)  = (zj, zm),  (x,  y)  e D.  are  the  m unknown  functions,  and  plk , 
/,  are  given  functions  defined  in  D x Em. 

Here  we  assume  that  D is  an  unbounded  region,  namely,  a (thin)  slab 
D = la  x Er,  la  - [x|0  < x < a].  Instead  of  the  usual  Cauchy  problem  (with 
data  at  x = 0).  we  consider  here  the  following  more  general  boundary 
value  problems,  with  data  on  m,  in  general  distinct  hyperplanes  x = at, 

0 < < a,  / = 1 m (though  we  do  not  exclude  that  some  or  all  of  the 

a,  coincide). 


I.  Let  a, , 0 < Uj  < a,  i = 1 m,  be  arbitrary  numbers,  let  i A,(y),  y e £r, 

i — 1 m,  be  given  functions,  and  let  us  determine  a solution  z(x,  y)  = 

(- zj,  (x,  y)  e la  x £r,  of  (1 ) satisfying 

z,(tf,  , y)  = iA,(y).  y e £r.  / =1 m.  (2) 

In  particular,  if  a,  = 0,  / = 1 m',  u,  = a.  i = m + 1,  ....  m,  0 < m < m. 

problem  (2)  reduces  to  the  problem  of  assigning  tri  of  the  functions  z,  on 
the  hyperplane  x = 0 and  the  remaining  functions  z,  on  the  hyperplane 
x = a.  If  iij  — (),/  = 1,  ....  m,  problem  (2)  reduces  to  the  Cauchy  problem. 

We  shall  also  consider  the  following  more  general  problem. 

II.  Let  <!,,()  < a,  < a,  i — 1 m,  be  arbitrary  numbers,  let  i A,(y),  c‘i7-(y), 

ye  £r.  i,  j = 1,  m.  be  given  functions,  and  let  us  determine  a solution 
z(x,  y)  = (z, zm),  (x,  y)  e D = la  x £r,  of  ( 1 ) satisfying 

m 

X . y)  = •l'i(y)’  > = i m.  (3) 

j=  i 

In  other  words,  we  assign  m distinct  linear  combinations  of  the  z,  with 
coefficients  depending  on  y,  on  m different  hyperplanes  x = at. 

Problem  II  reduces  to  problem  I when  [c,y(y)]  is  the  identity  matrix 
[ty(>’)]  = = 0,  for  /#;,  1, 7 = 1 m. 

We  shall  consider  here  problem  I in  all  its  generality,  and  problem  II 
when  [f(y(y)J  is  a matrix  with  “dominant  main  diagonal.”  Since  each 
equation  in  (3)can  be  multiplied  by  arbitrary  non/ero  factors,  it  is  convenient, 
for  the  sake  of  simplicity,  to  express  this  condition  in  the  following  form: 
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There  is  a number  au . 0 < a0  < I.  such  that  for  c,j(y)  ~ + c,.(y),  we  have 

m 

X |cy(.v)l  <o(J  < l ye  E\  /=! m.  (4) 

j=  i 

If  [c,J  is  the  identity  matrix,  that  is.  for  problem  I.  we  have  c,  (y)  — 0 and 
we  can  take  a0  = 0. 

The  problem  of  prescribing  the  values  of  the  functions  zt  on  ah  distinct 
hyperplanes  x = a,  (problem  I ).  or  of  m linear  combinations  of  them  on  the 
same  hyperplanes  (problem  II).  for  hyperbolic  systems  at  the  level  of 
generality  that  we  shall  take  into  consideration  is  rather  new.  Problem  I 
for  very  particular  systems  was  considered  by  Niccoletti  in  1X97,  and  more 
recently  was  mentioned  by  Hukuhara.  Aspects  of  these  problems  were 
discussed  later  by  different  authors  (see  references  in  [3]). 

Let  a0  > 0.  > 0 be  given  constants,  and  let  /i(i . denote  the  intervals 

/„,  - [x|0  < x < a0]  c E\  = [ — Q.  Q]m  c Em.  We  denote  by  |y|,  \z\  the 

norms  in  Er  and  Em  defined  by  |y|  = max[|yJk|,  k = 1 r],  \z\  - 

max  [ |r,|,  / = 1 m]. 

By  repeated  use  of  Banach's  fixed-point  theorem  we  have  proved  in  [2] 
the  following  theorem  of  existence,  uniqueness,  and  continuous  dependence 
on  the  data  for  system  (1)  with  boundary  conditions  II. 

Theorem  A.  Let  plk(x,  y,  z),  /,(x,  y,  z),  / = 1,  ....  ah,  k=  1 a-,  be 

functions  defined  in  lcl  x Er  x i2.  measurable  in  x for  every  (y\  r).  continuous 
in  (y,  z)  for  every  x,  and  let  us  assume  that  there  are  nonnegative  functions 
ah(x).  /(x),  h(x).  / , (x),  0 < x < Uq  , ah,  /,  ai,  /,  e L][0,  «0],  such  that,  for  all  (x,  y.  z), 
(x,  y,  z)e  /„  x E'  x fi,  i = 1,  ....  ah,  k = 1,  ...,  r,  we  have 

|p/*(x,  y,  z)|  < ah(x ),  |/((x,  y,  z)|  < h(x), 

|p.*(x,  y,  z)  - pjx,  y,  z)\  < /(x)[|y  - y|  + |z  - z|], 

I /.(x,  y,  z)  -f(x,  y,  z)|  < /i(x)[|y  - y|  + \z  - z\\ 

Let  Cjjiy),  y e Er,  i,  j = 1, . . . , ah,  be  given  functions,  and  let  i s assume  that 
there  are  constants  0 < o0  < 1,  L.  A > 0,  such  that,  for  all  \.  ve  Er  and 
/,  j = 1 ah,  we  have 

ki>(y)|  ^ r,  \ctJ(y)  - Cij{y)  I < A | y — y | , 

and  (4)  holds  Let  i/a ,(y),  ye  Er,  i = I ah,  be  given  functions  continuous 

in  E'.  and  let  us  assume  that  there  are  constants  <u0 , A0  > 0,  such  that  for 
all  y,  y e Er  and  i = 1 m,  we  have 

(»0  < (I  - AT 0)J2.  |i/a,(v)|  < (»o,  k/(y)  - <A,(y) | < A0|y  - y|. 
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Then  there  is  a constant  Q > 0 and  a number  a,  0 < a < a0 , such  that  for 
any  system  of  numbers  a,,  0 < a(  < a,  / = 1,  m,  there  are  functions 

z(x,  y)  = (zj,  zj,  (x,  y)  = (x,  y, yr)  e lu  x E\  continuous  in  lu  x £r, 

absolutely  continuous  in  x for  every  y,  such  that  for  all  (x,  y),  (x,  y)  e la  x Er 
and  / = 1,  m,  we  have 

|z,-(x,  >’) | < ii,  |z/(x,  y)  - z,(x,  y)|  <Q\y-y\, 

and  the  functions  zf  satisfy  (3)  everywhere  in  Er,  and  equations  (1)  almost 
everywhere  in  lu  x Er. 

A proof  of  this  existence  theorem  is  given  in  [2]  together  with  estimates 
of  the  value  of  a,  0 < a < a0,  which  depends  only  on  the  constants  aj0 . ii. 
o0 , A,  T,  A0,  and  given  functions  m,  /,  n,  /,.  We  also  prove  in  [1]  that  the 
solution  z(x,  y)  = (z{,  . zm)  of  the  problem  (1),  (3)  is  unique  in  the  class 
designated  in  [2]  and  depends  continuously  on  ip  = i J/m)  in  the 

uniform  topologies. 

Whenever  all  functions  p,*(x,  y,  z),/(x,  y,  z),  i pt(y),  c(J(v),  are  periodic  of 
period  7,  > 0 in  each  variable  ys,  s = 1,  r,  then  the  unique  solution 
z(x,  y)  — (z [, zm)  is  also  periodic  of  the  same  period  Ts  in  each  variable 

, .v  = 1 r. 


2.  Extension  of  Theorem  A to  Schauder’s  Systems 


A canonic  form  more  general  than  (1 ) for  quasilinear  hyperbolic  systems 
was  proposed  by  Sehauder: 


m 


I Au(x,  y,  z) 


ilzj/dx  + X Piki*'  T z)((1Zj/dyk) 


=f(x,  y,  z). 


i=  i 


*=  i 


(x,  y)  e D,  i = 1, ....  w,  (5) 

where  x,  y = (y,, . . . , >;)  are  the  independent  variables,  z(x,  y)  = (zj, . . zm) 
the  m unknown  functions,  where  p,k(x,  y,  z),  J\(x,  y,  z)  are  as  in  Section  1, 
and  Atj(x,y,z) are  given  bounded  functions  defined  in  I)  x with  det  [/1,J  > 
H > 0,  fi  a constant.  For  [^^(x,  y,  z)]  = [^,J,  the  identity  matrix,  system  (5) 
reduces  to  system  (I ). 

We  shall  assume  as  before  that  D is  a (thin)  slab  D = la  x Er,  and  we 
consider  for  (5)  the  same  boundary  value  problems  1 and  II  of  Section  1. 

In  order  to  present  a general  theorem  of  existence,  uniqueness,  and 
continuous  dependence  on  the  data,  we  assume  that  both  matrices 
[Aijix,  y,  z)]  and  [<  ,/>’)]  have  “dominant  main  diagonal”  in  a sense  that  will 
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be  explained  below.  First,  as  in  Section  1,  let  Cy(y)  = 6^  + ctj(y),i,j  = 1 , 
and 

m 

a0=  Max  Sup  £ |c,/y)|,  (6) 

i=l,  . m y e E'  j = I 

so  that  er0  = 0 when  [c.J  is  the  identity  matrix.  Concerning  the  matrix 
y,  z)]  first  we  denote  by  the  cofactor  of  Atj  divided  by  det  [A^], 
that  is,  y.,j  = (A  i)ji  and  let 

Au(x,  y,  z)  = + A^x,  y,  z),  otu(x,  y,  z)  = 6tj  + afJ(x,  y,  z), 

m 

<*i  = Max  Sup  XI  Aih(x,y,z)\, 

i = 1 m (x,  y,  z)  e D x ft  h = 1 

m 

a2=  Max  Sup  X I“m(*.  ^ z)l>  (7) 

i = 1 m (x,  y,  z)  e D * 12  /i  = 1 

m m 

a3  = Max  Sup  X Z y.*)|  |^i.(x,  y,  z)|, 

i=l m (x,  y,  z)  € D * ft  s = 1 h = 1 

( T = (7  j -f“  2 ^ 3 - 

Then  (7  > 0,  and  <7  = 0 when  A is  the  identity  matrix.  In  the  existence 
theorem  below,  we  shall  assume 

(7  + (70  + (7(70  <1.  (8) 

Thus,  if  [Atj]  is  the  identity  matrix  [system  (1)],  then  a = 0,  and  all  we 
need  is  that  < 1.  If  [c,J  is  the  identity  matrix  (problem  I),  then  a0  = 0, 
and  all  we  need  is  that  a < 1. 

By  repeated  use  of  Banach’s  fixed-point  theorem  we  have  proved  in  [3] 
the  following  theorem  of  existence,  uniqueness,  and  continuous  dependence 
on  the  data  for  system  (5)  with  boundary  conditions  II. 

Theorem  B.  Let  /4fj(x,  y,  z),  i,j  = 1,  — m,  be  continuous  functions  on 
la„  x Er  x Q with  det  [/f,J  > n > 0,  /c  a constant,  and  let  us  assume  that 
there  arc  constants  H , C>0  and  a function  m(x)  > 0,  0 <x<u0, 
m e L,[ 0,  a0],  such  that,  for  all  (x,  y,  z),  (x,  y,  z),  (x,  y,  z)e  I an  x £r  x ft  and 
all  i,j  = 1,  m,  we  have 

M0(x,  y,  z)|  < //, 

I Ajj(x,  y,  z)  - /4,j(x,  y,  z)|  < C[|y  - y|  + \z  - z|], 

X 

Mq(x,  y,  z)  - A^x,  y,  z)|  < I m(v)  dx  . 


256 


LAMBERTO  CESARI 


If  ay(x,  y,  z)  denotes  the  cofactor  of  A(j  in  the  matrix  [A^]  divided  by 
det  [Ay],  that  is,  = (A~  ‘);i,  then  certainly  there  are  constants  //',  C'  > 0, 
and  a function  m(x)  > 0,  0 < x < a0,  m e L,[0,  a0]  such  that,  as  above, 

K(x’  y*  z)l  ^ H ' 

| ay(x,  y,  z)  - a0(x,  y,  z)|  < C'[|y  - y|  + |z  - z|], 

X 

|a0(.x,  y,  z)  - a0(x,  y,  z)j  < j m (a)  da  . 

* X 

Let  pik(x,  y,  z),/i(x,  y,  z),  i = 1,  . . . , m,  k = 1,  . . . , r,  be  functions  defined 
in  IUq  x Er  x Q,  measurable  in  x for  every  (y,  z),  continuous  in  (y,  2)  for 

every  x,  and  let  us  assume  that  there  are  nonnegative  functions  m(x),  /(x), 

n(x),  l j(x),  0 < x < a0,  m,  l,  n,  lY  e L,[0,  a0],  such  that  for  all  (x,  y,  z), 

(x,  y,  z)  e IUn  x £r  x Q,  ( = l,  m,  k = 1,  . . . , r,  we  have 

I P,k(*,  y.  z) | < m(x),  | f(x,  y,  z)  j < n(x), 

I pik(x,y,z)  - pik(x,  y,  z)|  < /(x)[|y  - y|  + |z  - z|], 

I y«-(x,  y,  z)  —fi(x,  y,  z)|  < /a (x )[  | y — y|  + |z  - z |], 

Let  ^,(y),  c,7(y),  ye  F,  i,  j = 1,  •••,  be  given  bounded  continuous 
functions  in  £r,  and  let  us  assume  that  there  are  constants  o0,  ro,  A0,  r0, 
0 < o)0  < Q,  A0 , t0  > 0,  such  that  for  all  y,  y e Er  and  i — 1, . . . , m,  we  have 

I «Ai(y)  I |*A«(>’)  — *A,(y)l  <A0|y-y|, 

m 

| c,j(y)  I < r0 , II  C0(y)  - cfj(y)  | < t0  |y  - y \ . 
j=  1 

With  the  notations  (6)— (8)  (with  a = u0)  we  also  assume  that  cr  + <r0 
+ aa0  < 1,  and  then  H < 1 + <r,  H'  < 1 + er,  ro  < 1 + <70 . 

Then,  for  a,  ru0 , r 0 , C,  C'  sufficiently  small,  0 < a < a0 , fu0 , t0 , C,  C'  > 0, 
and  for  every  system  of  numbers  at,  0 < at  < a,  i — 1,  ...,  m,  there  are  a 
constant  Q > 0,  a function  y(x)  > 0,  0 < x < a,  y e £ ! [0,  a],  and  a vector 
function  z(x,  y)  = (zt,  zm),  (x,  y)e  la  x £r,  continuous  in  la  x £r.  satis- 
fying (3)  everywhere  in  £r,  satisfying  (5)  almost  everywhere  in  Ia  x £r,  and 
such  that,  for  all  (x,  y),  (x,  y),  (x,  y)  6 la  x £r  and  i = 1 m,  we  have 

| Z|(x,  y)|  < fi,  | Zj(x,  y)  - z,(x,  y)|  < Q|y  - y|, 

. X 

|z,(x,  y)- zf(x,  y)|  < | y(a)  dot  . 

J X 

The  function  z(x,  y)  is  unique  and  depends  continuously  on  ij/(y)  = 
(1  //,, t//m)for  zand  1//  in  classes  described  in  [3],  Also,  computable  estimates 
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of  a , oj0  , C,  C',  t0  are  given  in  [3],  which  depend  only  on  the  constants  Q, 
A0,  a,  ct0,  and  on  the  functions  m,  m\  m,  n , /,  /j,  hut  not  on  the  numbers 
a4 , 0 < a,-  < a,  i = 1 m. 

Again,  as  in  Section  1,  whenever  all  functions  /4y(x,  y,  z),  plk(x,  y,  z), 
_/j(x,  y,  z),  iAi(y),  Cjj(y)  are  periodic  of  period  Ts>  0 in  each  variable  ys, 
.s  = 1,  r,  then  the  unique  solution  z(x,  y)  = (zj,  zm)  is  also  periodic 
of  the  same  period  Ts  in  each  variable  y4,  s = 1,  r. 

3.  A Counterexample 

We  show  here  by  an  example  that  the  uniqueness  parts  of  Theorems  A 
and  B may  not  hold  when  the  “dominant  main  diagonal”  conditions  are 
not  satisfied. 

Let  m = 2 ,r  = l,pl5p2  constants,  px  # p2,[c,v]  = [1,  1;  1,  - 1 ]J\  =f2  = 0, 
t//,  = 1 1/2  = 0,  so  that  problem  (1),  (II)  reduces  to  the  linear  homogeneous 
problem  in  the  strip  0 < x < — oc  < y < + oc : 

dzjdx  + p,  dzjdy  = 0,  dz2/dx  + p2  czjdy  = 0, 

z,(0,  y)  + z2(0,  y)  = 0,  zx(a , y)  - z2(a,  y)  = 0, 

This  system  has  x-many  solutions.  In  particular,  the  following  family  of 
trigonometrical  polynomial  solutions  is  trivially  singled  out: 

N 

2,(x,  y)  = X sin  [(2s  + l)nb~  l(-pxx  + y)], 

s = 0 

s 

Z2(x , >)  = - X c's  sin  [(2s  + l)7i6_  *(-p2x  + y)], 

s = 0 

where  h = (px  - p2)u,  cs  arbitrary  constants.  Note  that  here  we  can  take 
a > 0 as  small  as  we  want. 

4.  Reduction  of  Hyperbolic  Systems  in  Two  Independent 
Variables  to  Schauder’s  Canonic  Form 

Quasilinear  hyperbolic  systems  in  m > 1 unknowns  W = (Wx,  Wm) 
and  two  independent  variables  x,  /,  of  the  form 

X K(x,  f,  W)(dWjldx)  + b^x,  t,  WfdWJdt)]  = Cj(x,  u w\ 

j=  i 

i = 1, m,  (9) 
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can  be  reduced  solely  by  algebraic  operations,  as  is  well  known,  to 
Schauder’s  systems  (5). 

Indeed,  let  us  denote  by  A(p)  the  m x m matrix  A(p)  = [a^p  — btj,  i,  j = 
1,  ....  m],  and  by  pl5  ....  pm  the  m roots,  distinct  or  coincident,  of  the 
algebraic  equation  det  A[p)  = 0 (the  characteristic  equation ).  For  every  root 
p of  this  equation,  we  denote  by  M the  usual  algebraic  multiplicity,  and 
by  p,  1 < p < M,  the  geometric  multiplicity,  that  is,  the  dimension  of  the 
linear  space  of  all  vectors  h — col  (hu  hm)  satisfying  (A(p))_1h  = 0,  or 

m m 

P Z “ij  hi  - z hijhi  = °'  i= 

i = 1 i = 1 


We  assume  here  that  the  roots  p of  the  equation  det  A(p)  = 0 are  all  real, 
not  necessarily  distinct,  but  for  each  of  them  geometric  and  algebraic 
multiplicities  coincide,  or  p — M.  Then  we  can  locally  determine  systems 

hr  = col  (hrl,  hrm ),  r = 1 m.  of  independent  vectors  such  that 

( A(pr))_1hr  = 0,  r = 1,  ....  m.  The  last  hypothesis  we  need  is  that  we  can 
choose  vectors  hr , r = 1, . . m,  as  above  in  such  a way  that  det  (A?x,  hm)  ^ 0 
in  the  entire  domain  covered  by  (x,  f,  z).  Then  system  (9)  is  equivalent  to  the 

one  we  obtain  by  taking  linear  combinations  of  them  with  factors  /irl /irm, 

and  then  letting  r run  from  1 to  m.  In  other  words,  we  have  the  system 

mm  m 

Z /!ri  Z MdWj/dx)  + bi}((:Wjldt)\  = £ /iric,,  r = 1 m. 

i = 1 j = 1 i = 1 


or 


z 

J- 1 


Ki « i} 


i = 1 


(<  Wj/cx ) 


m 


= Z hri  Ci , 


r = 1 w. 


where  now  pr  , atjhri  = i have  now  the  Schauder  system 


£ )(«  Wfix  + p,  d wjst)- 

]=  1 * i=  1 / 


m 


Z hrlci. 


or 


lA,l{ewl/ex  + p,ewt/et)  = F„ 


r = 1, ....  m. 
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5.  A Hyperbolic  Problem  in  Nonlinear  Optics 

For  a problem  of  nonlinear  optics  (see  Remark  1 ),  the  following  system 
of  nonlinear  Maxwell  equations  with  boundary  conditions  has  been 
proposed: 

— cH/dx  - k2  cE/dt  + i:E  dE/ct , — cE/cx  = cH/ct , 

0 < x < a,  - cc  < t < co,  (10) 
£(0,  t)  + H(0,  t)  = 2 E(a,  t)  - H(a , t)  = 0,  - oo  < t < oo, 

where  k > 1 is  a known  constant,  £ > 0 and  a > 0 are  “small”  constants, 
and  oj(t),  —oo<t<  + x,  is  a known  smooth  periodic  function  of  given 
period  T.  Relations  (10)  have  been  made  dimensionless  by  choosing  suitable 
units.  This  is  a simple  reduction  of  problem  (10)  to  a Schauder  system  (5) 
with  boundary  conditions  II,  satisfying  the  “dominant  main  diagonal 
condition.”  By  changing  E,  H into  k~lu  and  v,  and  by  changing  t into 
t — k~t,  problem  (10)  becomes 

ux  + v,  = 0,  vx  + ( 1 + e'u)u,  = 0, 
k ~ 1 u(0,  t)  + l(0,  t)  = 2 w(t),  k~  'u(a,  t)  - i(a,  f)  = 0, 

where  c'  = i.k  2 and  we  have  simply  written  t for  7.  By  introducing  new 
unknowns  U = 2“'(u  + r),  V — 2 ~l(u  — v),  so  that  u = U + V,  v = U — V, 
problem  (11)  becomes 

ux  + K + u,  - v,  = o, 

Ux  — Vx  + (l  — k'u)U,  + (1  + e'u)V,  = 0, 

2 _ 1 (A: _ 1 + 1 )U(0,  t)  + 2~  l(k~  1 - 1)F(0,  t)  = oJ(t), 

2~'(k~  1 - l)U(a,t)  + 2-'(k-'  + l)F(a,f)  = 0, 

whereu  = U + V System  (12)  is  of the  form  (9)  with  an  = 1 ,a12  = Fhn  = 1, 
b i 2 = — 1,  Cl  2 1 = F^22=  = ^ F-  U,  b 22  — 1 + £U. 

The  characteristic  equation  is  now  det  A[f))  = 0,  or 

det  [p  — 1,  p + 1 ; p — (1  4-  r.’u),  — p — (1  + e'u)]  = 0, 

or  p2  = 1 + £ u,  u = U + V.  For  1 1/|,  | V\  < M and  0 < £'  < 1/2M,  the  roots 

are  real  and  distinct,  p,  = (1  4-  £'u)‘  2,  p2  = “U  + e M)’  2-  now 

choose  nonzero  real  vectors  hi  = col  (/j,  ,,  /i12),  h2  = col  (h2 j,  Ii22)  such  that 
(4(pr))_  jlir  = 0,  r = 1,  2,  that  is, 

(pi  - l)/in  + (p,  - (1  + e'u))/i12  = 0, 

(p  i + I 1 1 + ( — P x — (1  + e u ))b  1 2 =0, 
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and  analogous  equations  hold  for  h2l,  k22  with  f>2  replacing  p,.  We  take  here 

hll  = (e'u)  '(1  + e'u  — px),  hl2  = (e'u)~  ‘(Pi  - 1), 

/i21  = (2  + e'u  - 2 p2)  '(1  + e'u  - p2),  /i22  = (2  + e'u  - lp2)  l(p2  - 1). 

A = det  [/?,,.  hl2  ; h2l,  h22 ] = -2(1  + e'u)1  2((1  + e'u)1  2 + 1)  2- 

Thus,  for  e = 0.  A = — 2 1 # 0,  and  for  1 1/|,  | K|  < M and  e > 0 sufficiently 
small,  we  also  have  A ^ 0.  System  (21)  is  now  equivalent  to  the  system 

/jri(L'x  + Vx  + U,  — V,)  + hr2  ( T x — K + (1  + e'u)Lt  + (1  4-  eu)Vt)  = 0. 

r = 1,2, 

or,  after  manipulations,  to  the  Schauder  system. 

(Ux  + P\  K)  + 0(vx  + p{  vt)  = o,  (ux  + p2  u,)  + (Vx  + p2  K)  — O' 

2 l(k  1 + 1 )t/(0.  t)  + 2~  l(k~  ] - 1)F(0.  t)  = oj(t), 

2'M  k 1 - 1 )L  (a.  t)  — 2~  l(k~ 1 + 1 )V(a,  t)  = 0, 

where  0 = [(1  + e'u)1  2 — 1 ]/[( 1 + e'u)1  2 + 1],  The  boundary  data  are  of 
type  II  with  c, , = c22  -2  1 ( A:  1 + 1),  c12  = c21  = 2”  M k~  1 - 1);  hence 
ctj  = Cjj  — = 2"  ’(/c-  1 — 1).  rr0  = 1 - k~  1 < 1.  The  coefficients  of  the 

system  are  [AtJ\  = [1,  0;  0,  1],  and  for  e'  = 0.  0 = 0 and  [/lu]  = [di;],  a = 0. 

Thus,  for  |(/|,  l^l  <M  and  e>0  sufficiently  small,  we  certainly  have 
a 4-  o0  + nnQ  < 1 . 

Remark  1.  If  a monochromatic  strong  laser  beam  is  focused  on  a thin 
crystal,  then  after  emerging  the  light  shows  a measurable  component  of  the 
second  harmonic  radiation.  For  instance,  a ruby  laser.  6940  A.  gives  rise  to 
the  3470  A component.  Franken  and  Ward  [4]  discuss  at  great  length  the 
experimental  and  theoretical  basis  of  this  phenomenon.  (See  also  Bloem- 
bergen  [1].)  In  his  lecture  [5],  Graffi  considered  the  main  nonlinear  problems 
of  electromagnetism,  and  particularly  those  of  nonlinear  optics.  From  Graffi’s 
exposition,  as  well  as  from  those  of  the  authors  mentioned  above,  it  appears 
that  Maxwell  equations  (10)  represent  the  simplest  model  for  a preliminary 
study  of  phenomena  of  nonlinear  optics,  otherwise  an  extremely  complex 
field.  The  boundary  conditions  for  the  plane  waves  in  (10)  can  be  justified 
as  follows  (from  correspondence  between  Graffi  and  the  author).  If  the 
crystal  in  the  xyz-space  is  the  slab  0 < .v  < a.  then  in  the  empty  space 
— x < x < 0 the  linear  Maxwell  equations  —//,  = £,,  —EX  = H,  have 
solutions  E(x,  t)  = </>(.x  + t)  + ip(x  - l),  H(x,  t)  = — <p(.v  + /)  + i J/(x  — t ). 
1 1/  a known  progressive  wave.  a regressive  wave,  and  thus  E(0.  t)  + H{ 0.  t)  = 
2i !/(  — <)■  F°r  u < v < -f  x.  again  E(.v,  r)  = <p(x  + t)  + i/ffiv  - f),  H(x,  t ) = 
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— </>(x  + r)  + tf(x  - r).  with  <p  — 0 (no  regressive  wave),  and  then  E(a , /) 

— H(a,  t)  = 0.  If  we  assume  that  £ and  //  are  continuous  across  the  surface 
of  the  crystal,  then  we  have  the  boundary  conditions  £(0,  /)  + //(().  t)  - 
2 oj(i).  E(a.  t)  - H(a , f)  = 0 for  the  nonlinear  Maxwell  equations  (10)  at 
x = 0 and  x = a,  with  oj(t)  — t/z(  — t).  The  study  of  problem  (10),  with 
boundary  conditions  assigning  the  values  of  linear  combinations  of  £ and  // 
on  the  surface  of  the  slab,  was  the  motivation  of  our  work  in  [2.  3],  some 
results  of  which  we  have  presented  in  Sections  1 3. 

Remark  2.  The  same  equations  (10)  in  usual  units  are  — Hx  — 
b2(E,  + E EE,),  - Ex  = h2H , in  the  crystal,  that  is,  for  0 < x < a (and 

— Hx  - £,£,,  — Ex  = nH,  in  the  empty  space,  and  thus  for  x < 0,  and  for 
x > a).  Here  e,.  f;2,  nx,  //2  are  the  dielectric  constant  and  the  magnetic 
permeability,  in  the  crystal  and  in  the  empty  space,  respectively.  £,  <e2, 
HX=H2-  Corresponding  boundary  conditions  (10).  and  reduction  to  a 
Schauder  system  (5).  with  boundary  conditions  II.  satisfying  (8),  valid  for 
all  values  of  the  constants  ex  < e2,  nx  — n2-  and  £ > 0 sufficiently  small, 
will  be  discussed  elsewhere  in  papers  by  the  author  and  Bassanini. 

For  boundary  conditions  of  the  Leontovich  SchelkunofT  type  in  other 
problems  of  electromagnetism,  compare  Schelkunoff  [8]  and  Ciraffi  [6],  For 
other  studies  of  nonlinear  Maxwell  equations,  compare,  e.g.,  Rivlin  and 
Venkataraman  [7], 
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In  recent  years  several  authors  have  used  techniques  based  on  Liapunov’s 
second  method  to  investigate  stability  phenomena  for  partial  differential 
equations  [1-4,  6 9]. 

The  first  step  in  such  an  investigation  is  to  interpret  the  given  partial 
differential  equation  as  a dynamical  system  on  some  appropriate  phase  space. 
The  remaining  part  of  the  investigation  is  to  use  the  already  existing 
Liapunov  theory  for  general  dynamical  systems  [10-12]  to  help  determine 
the  stability  properties  of  the  particular  dynamical  system  at  hand.  This 
Liapunov  theory  was  developed  mainly  in  the  context  of  ordinary  and 
functional  differential  equations.  Applying  it  to  the  study  of  partial  differential 
equations  seems  to  involve  serious  difficulties,  but  the  articles  already  cited 
[1-4,  6 9]  indicate  that  success  is  possible. 

The  subject  of  this  chapter  is  some  of  the  author’s  recent  work  involving 
the  procedure  just  indicated.  This  work  concerns  the  following  equations: 


u,(x,  t)  = uxx(x,  t)  + /(x,  u(x,  0), 

0 < X < 71,  0 < t < s, 

(la) 

ux(0,  t)  = ux(n,  t)  — 0, 

0 < t < s. 

(lb) 

u(.v,  0)  = 0(x), 

0 < X < 7T. 

(lc) 

Here, /is  a given  C2-smooth  function  mapping  [0.  tt]  x R into  R.  where  R 
denotes  the  real  number  system.  The  initial  datum  $ is  any  C'-smooth 
function  mapping  [0,  7t]  into  R such  that  tp'( 0)  = (j)'(n)  = 0.  Equations  (1) 
are  to  be  solved  for  a function  u continuously  mapping  a domain  of  the 
form  [0,  7t ] x [0,  s),  0 < s < lx,  into  R such  that 

(i)  ux  exists  and  is  continuous  on  [0,  n\  x [0,  s); 

(ii)  u,  and  uxx  exist  and  are  continuous  on  (0,  n)  x (0,  s); 

(iii)  u satisfies  Eqs.  (1)  on  the  domains  indicated. 

We  are  interested  in  the  asymptotic  behavior  of  solutions  of  Eqs.  (1). 
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In  a separate  article  [5]  we  have  given  a detailed  report  of  our  investiga- 
tions for  Eqs.  (1).  Here  we  shall  only  describe  some  of  our  more  salient 
results. 

Let  X be  the  space  of  all  C1  -smooth  functions  0:  [0,  n]  -»  R such  that 
0 (0)  = 0'(7r)  = 0.  For  each  0 e X we  let  0 ,0)  = sup  { |0(x)| : 0 < x < n\ 
and  0 = 0 l0*  + sup  { |0'(x)| : 0 < x < n).  The  space  X as  normed  by 

will  be  our  phase  space  for  Eqs.  (1).  The  norm  <0)  will  only  play 
an  auxiliary  role. 

For  any  0 e X,  Eqs.  (1)  have  a unique  noncontinuable  solution  u(0) 
defined  on  a domain  [0,  7t]  x [0,  s(0)),  where  0 < .s(0)  < + x.  For  each 
x e [0,  7t],  / e [0,  ,v(0)),  we  let  u(x,  t:  0)  denote  the  value  of  u(0)  at  (x,  t). 
Thus,  we  obtain  a strongly  continuous  semigroup  {U(t)\  on  X by  setting 
L’(/)0  = m(\  r;  0)  for  all  0 € X.  t e [0,  .v(0)). 

For  any  0 e X we  let  y(0)  denote  the  orbit  of  i/(0),  that  is,  y (0)  = 
{U(t)(f):  0 < / < s((p)\. 

By  an  equilibrium  solution  of  ( 1 ) we  mean  an  element  0 e X such  that 
s(0)  = +og  and  y (0)  = {0}. 


Theorem  1.  Let  0 e X and  suppose  that  y (0)  is  bounded  with  respect 
to  ,0’.  Then  ,s(0)  = +x  and,  with  respect  to  |(",  the  solution  i/(0) 
has  a nonempty  compact  connected  invariant  r o- limit  set  oj(0).  Moreover, 
U(t)<j)  -» w(0)  as  t — > + x,  the  convergence  here  being  relative  to  <n. 


The  proof  occurs  in  two  main  steps.  First,  one  shows  that  y(0)  must  be 
precompact  relative  to  This  part  of  the  proof  very  much  involves  the 
smoothness  properties  of  solutions  of  (1).  Second,  one  uses  standard  argu- 
ments from  dynamical  systems  theory  to  obtain  the  existence  of  oj((p)  and 
its  required  properties. 

Now  we  introduce  a Liapunov  functional  V:  X -»  R by  setting 


T(0)  = 


n 

I 

• 0 


. <t>(x  I 

f{x,  t] ) dt] 


■ 0 


dx. 


0€X. 


One  can  show  that  for  any  0 e X, 


F(t/(f)0)  = - | u,(x,  t ; 0)2  dx,  0 < t < .s(0). 

• o 

This  result  together  with  the  LaSalle  invariance  principle  [12]  leads  us  to 
the  following  theorem. 
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Theorem  2.  If  0 and  w(0)  are  as  in  Theorem  I,  then  each  element 
0 e ro(0)  is  an  equilibrium  solution  of  Eqs.  (1). 

Theorem  2 can  be  used  to  obtain  stability  properties  for  equilibrium 
solutions  of  Eqs.  (1).  Suppose  that  0 e X is  such  an  equilibrium  solution. 
Without  loss  of  generality  we  can  assume  that  0 is  the  origin  in  X.  Thus, 

/(x,  0)  = 0,  0 < x < 7r.  (2) 

For  any  number  b > 0 we  let  Q[ 0,  b ] be  the  set  of  all  0 e X such  that 
0 < 0(. v)  < b for  every  x e [0,  rr]. 

Theorem  3.  Given  Eq.  (2),  suppose  that  there  exists  a number  b > 0 
such  that 

(i)  /(x,  c)  <0  for  all  x e [0,  7t]  and  c e (0,  b}\  and 

(ii)  for  any  <;  e [0,  b ] there  exists  an  x e [0,  7t]  such  that /(x,  <f)  < 0. 

Then,  for  each  0 e (?[0,  b]  we  have  .s(0)<+oo,  y(0)^Q[O,  h],  and 
(7(r)0||'"-»  0 as  t -*  + oo. 

On  the  basis  of  Theorem  3 one  can  readily  formulate  criteria  for  instability 
and  asymptotic  stability  of  the  equilibrium  solution  0=0.  In  the  case  of 
asymptotic  stability  one  also  obtains  a region  of  attraction  for  0 = 0 (see 
[5,  Section  5]). 

By  a nonconstant  equilibrium  solution  of  Eqs.  ( 1 ) we  mean  an  equilibrium 
solution  0 that  as  a function  on  [0,  7r]  is  nonconstant. 

We  close  this  chapter  with  an  interesting  instability  theorem  for  which 
we  must  assume  that  the  function /in  Eq.  (la)  does  not  depend  on  x.  That 
is,  the  domain  of  / is  R and  Eq.  (la)  has  the  form 

u,(x,  t)  = uxx(x,  t ) +/(u(x,  f)),  0 < x < 71,  0 < t < s. 

Theorem  4.  If,  in  the  sense  just  indicated. /does  not  depend  on  x and 
if  0 is  an  isolated  nonconstant  equilibrium  solution  of  Eqs.  (1),  then  0 is 
unstable. 
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In  this  chapter  we  indicate  how  a convergent  two-variable  procedure 
developed  by  the  authors  [2]  can  be  used  to  analyze  the  behavior  of  the 
autonomous  nonlinear  wave  equation 

Vi,  = y 2x  + d.vi  - y I3).  y2,  = ylx-  ( i ) 

As  we  shall  see,  the  interesting  qualitative  information  about  ( 1 ),  such  as  its 
asymptotic  behavior  and  the  existence  and  stability  of  periodic  solutions, 
is  conveniently  analyzed  in  terms  of  a certain  ordinary  differential  equation 
in  the  “slow”  time  z = i:t.  The  analysis  produces  quantitative  data  as  well,  so 
we  can  also  present  explicit  formulas  approximating  the  transient  behavior 
for  long  but  finite  times  and  the  form  of  the  infinitely  many  periodic 
solutions  to  (1 ). 

The  main  result  for  (1)  is  that  a solution  having  a 27r-periodic  initial 
condition  converges  with  increasing  time  to  a time-periodic  limit  function  of 
the  same  period.  The  limiting  value  is  approximated  by  combinations  of 
left-  and  right-traveling  waves.  When  viewed  in  the  moving  frame,  each 
appears  as  a bounded  measurable  (Lr  ) discontinuous  function  that  is  positive 
when  its  initial  value  is  positive,  negative  when  it  is  negative,  and  zero  at 
the  zeros  of  the  initial  point.  The  positive  and  negative  amplitudes  of  the 
wave  are  the  same  at  each  point  where  it  is  nonzero.  Its  magnitude 
depends  only  on  the  measure  of  the  set  of  zeros  of  the  initial  function,  but 
not  on  the  initial  amplitude.  An  initial  point  having  a set  of  zeros  of 
measure  zero  has  a limit  solution  that  is  exponentially  asymptotically  stable 
and  that  acts  as  a limit  cycle  since  it  attracts  to  it  any  other  initial  value 
with  the  same  general  shape. 

Equation  ( 1 ) arises  in  a variety  of  applications.  For  example,  suppose  elec- 
tromagnetic plane  waves  are  traveling  between  parallel  conducting  surfaces 
in  a region  where  the  conductivity  varies  with  the  electric  field  as 
( T0(ot  | E |2  — 1 ).  An  application  of  Maxwell’s  equations  and  a suitable  scaling 
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of  dependent  and  independent  variables  gives  (1)  along  with  the  boundary 
conditions 

yi(t,  0)  = yi(t,  7t)  = o.  (2) 

Another  example  is  a resistance-loaded,  battery-driven  transmission  line  in 
which  the  series  resistance  per  unit  length  is  zero  but  the  shunt  leakage  has 
a cubic  characteristic  approximating  that  of  an  Esaki  diode.  After  a suitable 
analysis  it  is  found  that  the  line  voltage  and  current  obey  (1)  and  (2). 
Finally,  there  is  an  equation  related  to  (1), 

z«  - zxx  = - z,3),  z(t,  0)  = z(t,  n)  — 0,  (3) 

which  arises  in  the  study  of  galloping  oscillations  of  a power  line  in  the 
wind.  The  standard  transformation  y,  = z,  and  y2  = zx  reduces  (3)  to  (1J 
and  (2). 

A few  authors  have  discussed  ( 1 ),  (3),  and  related  equations.  Most  of  these 
studies  are  formal,  such  as  the  papers  of  Chikwendu  and  Kevorkian  [1], 
Keller  and  Kogelman  [4],  and  Myerscough  [6],  However.  Kurzweil  [5]  has 
given  a rigorous  analysis  of  the  van  der  Pol  string 

z„  ~ zxx  = e(l  - z2)z„  z(t,  0)  = z(t,  1)  = 0,  (4) 

and  the  authors  have  studied  (I)  in  [3].  Recently  we  have  obtained  many 
new  results  about  this  equation  and  some  appear  here  for  the  first  time. 

The  analysis  of  ( 1 ) begins  after  it  is  transformed  to  a more  convenient  form. 
We  note  that  (1)  is  an  evolution  equation 

y = Ay  + eF(y),  (5) 

where  all  the  solutions  to  the  unperturbed  part  are  2^-periodic  in  / when  the 
initial  values  are  27r-periodic  in  x (the  D'Alembert  representation).  Let  {E(t)) 
be  the  group  of  operators  solving  y = Ay.  Put  y(t)  = E(t)u(t)  to  obtain  the 
equation  for  «, 

u = e/(t,  u)  = t:E(-t)F(E(t)u ),  (6) 

where  / is  a 27r-periodic  Lipschitzian  vector  field. 

Generally  we  anticipate  that  a solution  to  (6)  will  contain  oscillations 
at  the  rate  of  the  period  of  the  vector  field  with  an  amplitude  varying 
slowly  with  time.  To  pick  out  the  periodic  part  we  use  a device  due  to 
Cesari  and  Hale  and  replace  (6)  by  the  relation 

dw/drr  = w)  — Mf(w)\ , 


(V) 
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where  M is  the  mean  value  operator 

M/(w)=  ' | 2 /(.s,  vv(.s))  ds.  (8) 

Z7C  . o 

Since  the  vector  field  in  (7)  has  no  mean  value,  every  solution  w(r t,  v,  f) 
with  initial  value  v is  27t-periodic  in  a.  Of  course,  it  does  not  satisfy  (6). 
However,  we  can  show  that  for  each  v in  Tx , w(rr,  v,  f.)  and  its  Frechet 
derivative  in  v,  wv(<r,  v,  f.),  form  a-continuous  Lipschitzian  vector  fields 
themselves.  This  is  true  even  though  the  right-hand  side  of  (7)  is  not 
continuous  in  a on  L.,  . 

The  initial  value  v is  still  at  our  disposal.  Let  us  vary  it  so  that 

u(t,  e.)  = w(rr,  r(r),  e),  a = t,  t = Ft,  (9) 

will  satisfy  (6).  An  application  of  the  chain  rule  to  (9)  along  with  (7)  and  (6) 
shows  that  r must  satisfy 

wv{x/f,  v,  t.)  dv/dx  = Mf(w( •,  v,  /;)).  ( 10) 


From  the  representation  of  w as  an  integral  equation  (valid  when  e is  small) 

a 

w(cr,  v , /:)  = v + e I {/(.s,  w(s,  v,  t :))  - Mf(w( •,  v,  e))}  ds,  (11) 
• o 


it  can  be  deduced  that  w'  1 exists  and  is  bounded  and  Lipschitz  continuous 
in  v when  i:  is  small.  Thus,  (10)  can  be  solved  by  Picard’s  theorem  on  some 
interval  0 < r < a. 

Let  r(r,  t: ) satisfy  ( 10)  on  [0,  a].  Because  the  integrand  in  (11)  has  no  mean 
value,  u(t,  t:)  — r(et,  /;)  is  0(r.)  as  r.  ->  0 on  0 < t < a/r..  Furthermore,  if  v(c) 
is  a (stable)  equilibrium  point  to  (10),  then  w(t,  l(t),  f.)  is  a (stable)  2 n- 
periodic  solution  to  (6).  Using  the  continuity  of  v(x,  f.)  in  f.,  it  follows  that  the 
solutions  i(t)  to  (10)  when  f = 0. 


dv 

dx 


2 n 

J'(s,  v ) ds, 
o 


(12) 


are  also  asymptotic  to  the  solutions  to  (6).  Finally,  isolated  equilibrium 
points  v0  to  (12)  correspond  to  periodic  solutions  of  (6)  and  these  ,m 
exponentially  asymptotically  stable  if  the  same  is  true  for  r0 

Equation  (12)  is,  of  course,  the  classical  averaged  equation  ' 
Although  the  results  described  above  are  simply  the  usual  ^ 
of  the  method  of  averaging,  our  techniques  are  neither  gencia  u» 
applications  of  that  theory. 
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After  a tedious  calculation  we  obtain  the  specific  form  of  Eq.  (12): 

®i'  =i»i  ~ ^ 

n2'  = 2^2  - i{»23  + 3pj2p2  + 3m>2}, 

where  the  functional  a is  given  by 

«(t)  = 2n  I x)  + ”22(^  *)}  dx • (14) 

These  can  be  almost  uncoupled  by  letting  vx  = w,  + w2  and  v2  = w , — w2 
to  get 

w/  = (i  - 50R  - 

for  each  i = 1,  2.  Here  /i  has  the  same  form  as  a but  with  wx  and  w2 
replacing  vx  and  i?2  • 

If  the  values  of  w,  (0,  x)  = a,(x)  are  specified,  then  the  solution  to  (15) 
can  be  reduced  to  quadrature.  These  formulas,  which  are  too  long  to  give 
here,  describe  the  approximate  transient  behavior  for  times  of  order  1 for 
any  given  initial  value. 

By  letting  t -»  +oo  in  w,(t,  x),  the  equilibrium  points  for  (13)  can  be 
found.  These  are  simply  sums  and  differences  of  and  w2,  where 

w,(x)  = {2/(2  + 3A)},/2  sgn  a,(x).  (16) 

Here  A = (Ax  + A2)/2A  and  A,  is  the  measure  of  the  set  where  a,(x)  ^ 0. 
A direct  application  of  the  implicit  function  theorem  shows  that  each  w,(x) 
is  isolated  and  so  corresponds  to  a periodic  solution. 

The  variational  equation  for  (13)  is  quite  interesting.  Its  form  varies  with 
x.  At  those  x’s  where  a,(x)  = 0,  it  has  an  unstable  node.  Even  where 
a,(x)  / 0,  the  solution  is  unstable  if  A <2.  However,  if  A = 2 and 
a,(x)  / 0,  then  the  solution  decays  as  c_,/4.  Thus,  the  initial  points  whose 
sets  of  zeros  have  measure  zero  generate  exponentially  asymptotically 
stable  periodic  solutions.  Furthermore,  each  of  these  attracts  any  initial 
function  having  the  same  signum  value  as  the  limit  function  and  thus  acts 
as  a limit  cycle  by  drawing  to  it  large  classes  of  initial  points. 


REFERENCES 

[1]  S.  C.  Chikwendu  and  J.  Kevorkian,  A perturbation  method  for  hyperbolic  equations  with 
small  nonlinearities,  SIAM  J.  Appl.  Math.  22  (1972),  235-258. 

[2]  J.  P.  Fink,  W.  S.  Hall,  and  A.  R.  Hausrath,  A convergent  two-time  method  for  periodic 
differential  equations,  J.  Differential  Equations  15  (1974),  459-498. 


DISCONTINUOUS  PERIODIC  SOLUTIONS 


271 


[3]  J.  P.  Fink,  W.  S.  Hall,  and  A.  R.  Hausrath,  Discontinuous  periodic  solutions  for  a 
nonlinear,  autonomous  wave  equation,  Proc.  Royal  Irish  Acad,  (to  appear). 

[4]  J.  B.  Keller  and  S.  Kogelman,  Asymptotic  solutions  of  initial  value  problems  for  nonlinear 
partial  differential  equations,  SIAM  J.  Appl.  Math.  18  (1970),  748-758. 

[5]  J.  Kurzweil,  van  der  Pol  perturbations  of  the  equation  for  a vibrating  string, 
Czechoslovak  Math.  J.  17  (1967),  588-608. 

[6]  C.  J.  Myerscough,  The  growth  of  full  span  galloping  oscillations.  Central  Electricity 
Research  Laboratories,  Leatherhead,  Surrey,  England,  Lab.  notes  No.’s  RD/L/N271/73 
and  RD/L/N51/72. 


Continuous  Dependence  of  Forced  Oscillations 

for  U,  = V-y(|Vw|)Vw 

THOMAS  I.  SEIDMAN 
Department  of  Mathematics 

University  of  Maryland.  Baltimore  County.  Baltimore.  Maryland 


We  consider  the  autonomous  nonlinear  parabolic  equation  of  the  title 
with  forcing  provided  by  r-periodic  boundary  data.  Under  appropriate 
conditions  on  y(-)  there  is  a unique  r-periodic  weak  solution  and  we  are 
concerned  to  show  that  this  depends  continuously  on  the  data  and  on  the 
function  y.  The  equation  arises,  for  example,  in  the  analysis  of  induced 
currents  in  a nonlinearly  ferromagnetic  material  (Vu  is  the  field,  y the 
reluctivity). 

Let  Q be  a bounded  region  in  9?"  with  smooth  boundary  8CI.  With 
T : = 5K/Z  let  Q : = Q x T and  L : = SCI  x T.  Considering  functions  and  data 
as  defined  on  Q or  X “builds  in”  the  periodicity.  Thus,  rather  than 
seeking  periodicity  among  solutions  (e.g.,  by  applying  a fixed-point  theorem 
to  the  period  map  for  the  initial  value  problem),  we  look  for  solutions  in  a 
space  of  r-periodic  functions. 

It  is  convenient  to  impose  the  conditions  on  •/(■)  in  terms  of 
g(r):  = ry(r );  we  assume  g is  positive  and  differentiable  for  r > 0 and  that, 
for  some  p > 1, 

(i)  g(r)  = C (rp~  1 ) as  r —*  oo ; 

(ii)  m(r)  > 0 for  r > 0,  where  m(r):=  inf  {p(i) 2~pg'(p) : p >r). 

With  this  p,  let 


Y :=  {u  defined  on  Q : Vu  e Lp(Q  -*  9U)}, 

Y0 : = {i;  e Y : v satisfies  homogeneous  b.c.}. 

I 

We  assume  that  the  b.c.  are  such  that  T0  is  a Banach  space  with 


f IH' 


>/P 


(this  follows  for  Dirichlet  b.c.  by  the  Poincare  inequality;  for  Neumann  b.c., 


Preceding  page  blank 
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as  in  the  application,  it  is  necessary  to  impose  an  auxiliary  condition  on 
y0  such  as  jn  u = 0).  Let 


X:  = 


Y * 
1 0 


and  assume  the  boundary  data  ip  are  taken  from  a space  F (of  functions  on 
I)  for  which  the  extension  map  <pi— » <p  (<p  satisfying  the  b.c.  with  data  ip)  is 
continuous  from  F to  X.  Thus,  v : = u — <p  is  to  satisfy 


v,  = V • y(  | Vf  + V<p|)[Vt;  + V<p]  - (pt 


in  the  sense  that  v,  = — A(v  + (p)  — <pf,  with  A given  by 


A(u)w  : = y(|Vu|)Vu  • Vw,  ueY,  weY0. 

JQ 

From  the  conditions  on  y(  ) it  follows  that  A:  Y -*  Y0*  is  well  defined 
and  continuous. 


Lemma.  For  uu  u2e  Y such  that  (ul  — u2)e  Y0, 

[^(u,)  - A(u2)](u1  - u2)  > V(||U|  - u2||;  ||u2||), 

with 

T(r,  R):=  rpm(cr)Kp(r/R). 

Here  c :=  §[4  meas  (ft)]1/p  and  Kp(s)  is  independent  of  s for  p >2  and 
6(s2~p)  as  s-»0  for  1 < p < 2;  p , m(  ) provide  the  only  dependence  of 
T on  y(  ). 

Suppose  now  that  v°  = u°  — <p°  (with  v°  e Y0  and  <p°  e X)  satisfies 
v,°  = — A(v°  + (p°)  — (p,°  and  let  ve  Y0  satisfy  v,  = —A(v  + <p)  — (p,  for 
some  (p  e X.  Since  JQ  t;,  v = 0 for  differentiable  ve  Y,  one  has 

[A(v  + <p)  - A(v°  + <p)](v  - v°) 

= [/l(u°  + <p°)  - A(v°  + (p)](v  - v°)  + I (<p  - (p°),{v  - u°). 

• Q 

The  right-hand  side  goes  to  0 as  <p  -*  (p°  in  X whence,  by  the  lemma, 
v -»  v°  in  Y0 , so  u = v + <p~*  u°  in  Y. 

Similarly,  we  consider  y -»  y°  with  y,  y°  satisfying  the  conditions  uniformly 
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[i.e.,  fixed  p and  (9(rp  '),  m{  ) uniform  in  (i),  (ii),  respectively]  and  let  u, 
u°  satisfy  the  corresponding  equations  with  the  same  data.  Then 

[A(u)  - ,4(u0)](u  - u°)  = [A(u°)  - -4°(u°)](u  - u°)  < Cp\\u  - u°||, 

with  C depending  on  ||u°||  (but  not  on  y or  otherwise  on  y°,  </>°)  and 

P :=  sup  {(1  + r)l-p\g{r)  - g°(r)\ : r > 0}. 

Thus,  by  the  lemma,  u -*  u°  in  Y as  p -*  0 (y  -*  y°). 
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1.  Introduction 

Hydrodynamic  stability  is  concerned  with  the  evolution  of  laminar,  or 
streamline,  flow  to  the  turbulent,  or  eddying,  form.  Often  transition  to 
turbulence  entails  an  initial  bifurcation  from  a given  simple  laminar  flow 
to  a complicated  laminar  flow  of  wavelike  structure.  The  greatest,  albeit 
modest,  success  of  applied  mathematics  in  the  subject  lies  in  the  description 
of  this  relatively  simple  part  of  transition.  Even  after  many  decades  of  effort, 
we  have  no  complete  mathematical  or  physical  theory  of  transition  to 
turbulence.  For  this  reason,  this  chapter  is  concerned  with  transitions  or 
bifurcations  from  a basic  form  of  laminar  flow  to  a more  complicated, 
wavelike  form. 

Many  technical  and  detailed  mathematical  problems  have  been  studied. 
For  the  purposes  of  this  chapter,  we  shall  refer  to  three  such  examples, 
chosen  from  areas  that  we  have  found  to  be  of  special  interest.  We 
should  emphasize  that  we  are  concerned  with  formal  applied  mathematics: 
apart  from  just  a few  papers,  which  have  appeared  in  the  last  few  years,  the 
subject  has  not  been  developed  rigorously  in  the  truly  mathematical  sense. 
The  three  examples  of  this  chapter,  the  study  of  which  would  benefit  from 
greater  rigor,  concern  (i)  wavy  Taylor  vortices  in  the  concentric  case,  (ii) 
Taylor  vortices  in  the  eccentric  case,  and  (iii)wave  systems  in  plane  Poiseuille 
flow.  These  topics  are  treated  briefly  in  Sections  2-4,  in  a deliberately 
simplified  manner. 


2.  Wavy  Taylor  Vortices 

We  consider  two  long  concentric  circular  cylinders  of  radii  a (inner)  and 
h (outer),  the  speed  of  the  inner  being  with  the  outer  at  rest. 

2,7  Preceding  page  blank 
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Moreover  we  let  d = b - a,  and  denote  the  kinematic  viscosity  by  v.  It  is 
known  that,  if  the  parameter  Ta  = (qlalv)2(dla)i  exceeds  a critical  value  of 
order  1700,  the  first  bifurcation  point,  the  simple  laminar  circumferential  4 

flow  is  replaced  by  Taylor-vortex  flow,  which  is  periodic  along  the  axis  and 
is  composed  of  a steady  swirling  in  the  form  of  toroidal  vortices, 
superimposed  on  a circumferential  flow.  In  fact,  the  simple  laminar  flow 
becomes  unstable  and  bifurcates  to  the  Taylor-vortex  flow,  the  axial  wave- 
length being  given  experimentally  at  the  critical  value  of  the  Taylor  number. 

At  a still  higher  Taylor  number,  constituting  a second  bifurcation  point, 
the  Taylor-vortex  flow  itself  becomes  unstable  and  bifurcates  to  a stable 
wavy  vortex  flow,  in  which  the  Taylor  vortices  develop  a periodicity  and 
propagate  with  time  in  the  circumferential  direction. 

A quantitative  measure  of  the  flow  field  is  given  by  the  torque,  which  is  \ 

required  to  be  exerted  on  the  inner  cylinder  in  order  to  maintain  the  flow. 

In  simple  laminar  flow  the  torque  rises  with  Taylor  number.  Above  the 

first  bifurcation  point,  moreover,  the  stable  Taylor  vortex  flow  requires  a ; 

greater  torque  than  the  simple  laminar  flow,  which  is  now  unstable;  in 

addition,  the  torque  continues  to  rise  with  the  Taylor  number.  Above  the 

second  bifurcation  point,  however,  the  stable  wavy  vortex  flow  has  a lower 

torque  than  the  unstable  Taylor-vortex  flow  at  the  same  Taylor  number. 

The  study  of  the  instability  of  simple  laminar  flow  and  of  the  development 
of  Taylor-vortex  flow  above  the  first  bifurcation  point  has  made  considerable 
strides,  both  experimentally  and  theoretically,  and  these  have  been  sum- 
marized by  Stuart  [1].  Agreement  between  theory  and  experiment  is  good. 
Correspondingly,  the  problem  of  instability  of  Taylor-vortex  flow  has  been 
studied,  with  reasonable  agreement  with  experiment  as  to  the  location  of 
the  second  bifurcation  Taylor  number.  The  work  is  also  described  in  [1]. 

More  recently.  Eagles  [2]  has  calculated  the  torque  of  wavy  vortices  and 
shown  that,  subject  to  certain  assumptions,  the  calculated  wavy-vortex 
torque  is  lower  than  that  of  Taylor  vortices  at  the  same  Taylor  number,  in 
agreement  with  experiment. 

Problems  remain  to  be  solved:  for  example,  the  second  bifurcation  point, 
as  it  has  been  described  here,  is  in  truth  a cluster  of  bifurcation  points, 
with  different  circumferential  wave  numbers;  the  mechanism  of  choice  of 
the  circumferential  wave  number  is  not  known.  Moreover,  to  our  knowledge, 
there  is  no  mathematically  rigorous  treatment  of  the  second  bifurcation, 
which  involves  time  and  angular  dependence.  The  field  seems  ripe  for 
mathematical  treatment. 
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3.  Eccentric  Taylor  Vortices 

In  Section  2,  the  simple  laminar  flow,  whose  instability  is  first  treated, 
is  very  simple  indeed.  In  practice,  however,  more  complex  basic  laminar 
flows  arise,  for  example,  in  the  aerodynamical  and  geophysical  sciences. 
Usually  the  basic  flow  varies  significantly  in  the  flow  direction,  in  contrast 
to  the  simple  circumferential  flow  between  concentric  circular  cylinders 
discussed  previously.  An  example  of  this  type  occurs  in  the  journal  bearing 
of  lubrication  technology.  This  problem  may  be  idealized  as  the  flow 
between  two  circular  cylinders,  whose  axes  are  displaced  by  an  amount 
aeS , where  a,  b are  the  cylinder  radii  and  3 = (b  — a)/a.  The  quantity  <5, 
which  is  often  small  in  practice,  is  known  as  the  clearance  ratio,  while  e 
(0  < e < 1)  is  the  eccentricity.  The  fact  of  this  eccentric-cylinder  configura- 
tion having  a load-bearing  capability  is  dependent  on  e being  nonzero. 

There  is  one  main  way  in  which  the  instability  problem  of  the  basic  flow, 
or  first  bifurcation,  differs  from  that  for  the  concentric  case:  with  the 
assumption  of  sinusoidal  periodicity  along  the  axis  and  of  exponential 
dependence  on  time,  the  “concentric”  differential  equations,  which  yield  the 
critical  Taylor  number  through  an  eigenvalue  problem,  are  ordinary.  In 
contrast,  because  the  basic  flow  in  the  eccentric  case  varies  in  the  circum- 
ferential (or  flow)  direction  and,  furthermore,  depends  on  that  circumferential 
angle,  the  differential  equations,  which  yield  the  critical  Taylor  number,  are 
partial  with  radial  and  circumferential  derivatives. 

From  an  experimental  point  of  view,  there  are  several  ways  in  which  the 
eccentricity  manifests  itself  in  the  development  of  the  Taylor  vortices.  In 
many  experiments,  but  not  all,  the  initial  Taylor  number  rises  monotonically 
with  eccentricity,  as  illustrated  in  the  data  summarized  by  DiPrima  and 
Stuart  [3].  Second,  there  is  a particular  experiment,  due  to  Vohr  [4], 
which  indicates  that  the  position  of  maximum  vortex  strength  occurs  some 
50°  downstream  from  the  maximum  gap.  Third,  Vohr  (4)  shows  that  the 
angle  at  which  the  torque  bifurcates  at  the  critical  Taylor  number  shows 
significant  dependence  on  e. 

A formal  perturbation  theory,  which  is  assumed  valid  for  small  e and 
utilizes  the  specified  relation  31'2  = ke,  where  k is  a positive  constant,  has 
been  developed  in  [3].  The  authors  find  that  the  critical  Taylor  number 
does  rise  monotonically  with  e,  and  agreement  with  experiment  is  quite  good. 
This  arises  from  a linear  calculation.  As  a byproduct  it  is  found  that 
maximum  vortex  strength  lies  at  90°  downstream  from  maximum  gap.  When 
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the  perturbation  scheme  is  extended  to  include  nonlinear  effects,  as  DiPrima 
and  Stuart  [5]  have  done,  the  choice  of  parametric  values  appropriate  to 
experiment  does  yield  a position  of  maximum  Taylor-vortex  strength  more 
nearly  at  50°.  This  effect  of  nonlinearity  is  made  evident  through  the 
structure  of  the  first  term  in  the  expansion  of  the  solution  of  the  nonlinear 
partial  differential  equation  for  the  Taylor-vortex  flow,  in  the  form  of  a 
function  of  a radial  variable  times  a function  B((p),  where  ^ is  a circum- 
ferential angle.  This  function  B(4>)  is  found  to  satisfy 

k dB((f))/d4>  = yB((f>)  cos  </>  + - y2  B3(4>), 

and  arises  essentially  from  the  method  of  multiple  scales;  y,  y,,  and  y2  are 
positive  numbers;  k has  been  defined  earlier,  while  the  positive  number  7i 
dictates  the  amount  by  which  the  Taylor  number  exceeds  its  critical  value. 
A choice  of  k and  Tx  appropriate  to  experiment  yields  the  50°  mentioned. 
Caution  is  necessary,  however,  because  of  the  experimental  magnitude  of  e, 
which  may  be  too  large  for  application  of  the  theory.  Theoretical  results  for 
torque  are  not  yet  available. 


4.  Wave  Systems  in  Shear  Flows 

In  the  problems  of  Taylor  vortices  in  Sections  2 and  3,  attention  has 
been  focused  on  bifurcations  associated  with  the  development  of  a new 
stable  solution  associated  with  a particular  wave  number  or  pair  of  wave 
numbers.  In  many  boundary-layer  situations,  however,  it  is  found  that 
streamwise-propagating  waves,  arising  from  instability  of  a basic  laminar 
flow,  have  energy  spread  over  a spectrum  of  wave  numbers.  This  wave  system 
evolves,  in  a way  that  is  imperfectly  understood,  to  produce  turbulence.  A 
theoretical  example  of  this  situation,  but  one  that  has  experimental 
difficulties,  occurs  in  plane  Poiseuille  flow,  which  occurs  when  fluid  is 
driven  under  pressure  between  two  parallel  planes.  We  now  concentrate 
briefly  on  that  case. 

An  essential  parameter  is  the  Reynolds  number  R = U0  h/v,  here  U0  is  the 
maximum  value  of  the  velocity  distribution,  which  is  parabolic  in  the 
coordinate  between  the  two  parallel  planes,  2 h the  distance  between  them, 
and  v the  kinematic  viscosity.  A velocity  perturbation  mode  of  the  form 
exp  [/a(x  — cr )],  where  x is  downstream  distance,  t time,  and  a and  c 
constants,  is  exponentially  amplified  in  x and/or  t if  R is  greater  than  a 
critical  value  Rc.  This  occurs  at  a particular  value  of  a,  namely,  ac.  For 
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larger  Reynolds  numbers  there  is  a band  of  values  of  a for  which  instability 
takes  place.  We  wish  to  discuss  how  a wave  system  can  evolve  with  energy 
spread  over  this  band  of  wave  numbers.  A number  of  authors  have 
independently  studied  problems  of  this  type  with  application  to  traveling 
waves  in  shear  flows,  including  DiPrima  et  al.  [6],  Newell  and  Whitehead  [7], 
and  Stewartson  and  Stuart  [8].  The  latter  especially  treated  plane  Poiseuille 
flow  directly  by  formal  expansion  in  terms  of  a parameter  e proportional  to 
( R - Rc).  By  the  method  of  multiple  scales,  the  first  term  in  the  assumed 
series  for  the  perturbation  velocity  is  found  to  be  proportional  to 
£1/2A(£,  t)  exp  [iac(x  - cf)],  where  £ = ei,2(x  — Cgf),  t = et,  Cg  is  the  group 
velocity,  and  A{£,  t)  satisfies 

f]  A r]  ^ A 

- - a2  2 = (l  + ia>)A  + kA\A\2, 

a2  and  k being  complex  numbers  and  a>  real. 

There  are  two  features  of  the  above  amplitude  equation  to  which  we 
draw  attention:  (i)  near  Rc  and  ac,  the  real  part  of  k is  positive,  so  that  the 
bifurcation  is  subcritical,  leading  to  unstable  modes  with  R < Rc  provided 
their  amplitude  is  large  enough,  as  described  in  [1]  and  by  Chen  and 
Joseph  [9];  (ii)  a phenomenon  of  a self-focusing  “burst”  to  larger  amplitudes 
can  occur,  for  which  reference  is  made  to  Davey  et  al.  [10]  and  papers  cited 
there. 
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1 

The  concept  of  normality  for  a control  process  represented  by  an  ordinary 
differential  equation 


dx/dt  — A{t)x  = B(t)u(t ) (U) 

was  introduced  by  J.  P.  LaSalle  in  order  to  ensure  uniqueness  of  time 
optimal  controls.  The  meaning  and  the  applicability  of  such  notions  are 
analyzed  here  in  a broader  context  than  that  of  LaSalle. 

The  symbols  used  in  (U),  as  usual,  represent  t,  a real  variable  in  a 
compact  interval  / = [t,  T];  x = x(t),  a real  n-vector;  u(t),  a real  m-vector; 
A(t)  and  B(t),  real  matrices  of  types  n x n,  n x m,  respectively.  The  functions 
t -*  A(t)  and  t -*  B(t)u(t)  are  (Lebesgue)  measurable  and  integrable  in  /. 
We  shall  further  denote  by  H(t ) the  n x m matrix 

H(t)  = X(T)X~l(t)B{t), 

where  X is  any  fundamental  matrix  solution  of  the  linear  equation 

dx/dt  — A(t)x  = 0. 

According  to  LaSalle  [1,  p.  65]  the  control  process  (U)  is  said  to  be  a 
normal  one  iff 

y*  0,  j = 1,  m=>y*hj(t)  / 0,  a.e.  tel,  (1.1) 

where  y e En,  y * is  the  transpose  of  y,  and  h}{t)  is  the  y'th  column  of  H(t). 
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Relation  (1.1)  is  equivalent  to  the  following:  for  each  y # 0 there  is  a 
unique  u such  that 

y*  | H(t)u(t)  di  = sup  y*  | H(t)w(t)  dt,  (12) 

• / we  Bx  r -l 

weflXiX,  (1.3) 

where  flx  x denotes  the  unit  ball  of  the  Banach  space  U (/,  lm‘ ). 

The  normality  condition  ensures  that  for  every  v e En  belonging  to  the 
domain  of  zero  controllability  (i.e.,  to  »he  set  of  initial  states  v that  can  be 
transferred  to  zero  in  a finite  time),  corresponding  to  the  set  of  controls 
C — By  x , there  is  a unique  time  optimal  control  steering  the  state  to  zero. 


2 

The  concept  of  normality  can  be  defined  for  a class  of  control  sets  C more 
general  than  Bx  x . In  [2]  we  have  considered  the  case  C = Bp  , ■ , the 
unit  ball  of  the  Banach  space  If (/,  fm),  with  1 < p'  < x,  1 < r < x.  In  that 
case  (U)  is  said  to  be  normal  for  Sp  -r-  iff  there  is  a unique  solution  u of 
(1.2)  and  (1.3)  with  Bp.  r.  replacing  £x  x.  Then  it  can  be  proved  that 
(if  m > 2)  there  are  six  different  types  of  normal  processes,  which  can  be 
characterized  in  terms  of  A (via  X)  and  B. 

If  we  denote  by  Np  r.  the  property  of  (U)  being  normal  for  Bp  r we 
have  the  following  implications: 

Np\  X =>  Np-,  r‘  ^ Np',  1 

ft  ft  ft 

^x.x  ^^x.r'^  Wx.  1 

Here  Nx>x  corresponds  to  LaSalle’s  original  definition  and  Np • r. , 

1 < p'  < oo,  1 < r'  < oc,  to  the  complete  controllability  of  (U). 

When  A and  B are  independent  of  f,  then  there  are  only  three  different 
types,  since  Np.  r.  = Nx  r.  for  1 < r'  < oo,  and  they  can  be  characterized 
directly  in  terms  of  A and  B. 


3 

Let  us  now  consider  normality  in  a more  abstract  setting,  as  an  attempt 
to  extend  it  to  infinite-dimensional  control  processes. 
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We  are  given  two  Banach  spaces  U and  X , a linear  continuous  map 
L:  (J  -*  X,  and  a convex  set  Cc(/,  not  reduced  to  a single  point.  For  each 
x'  e X'  (the  normed  dual  of  X),  the  set  of  u such  that 

x'(Lu)  = sup  x'(Lw),  (3.1) 

Vi  6 C 

ueC,  (3.2) 

is  convex  (possibly  empty).  Then  we  say  that  L is  normal  for  C iff  for 
each  x'  e X',  x'  / o,  there  is  at  most  one  u satisfying  (3.1)  and  (3.2). 

A comparison  with  Section  2 gives  U = If' {I,  Fm),  X = L defined  by 
Lu  = H(t)u(t)  dt,  and  C = Bp  r. . 

With  reference  to  [1,  p.65]  we  also  say  that  L is  essentially  normal  for 
C iff  the  set  LC  is  strictly  convex.  This  means  that  every  closed  support  plane 
to  LC  meets  LC  at  just  one  point. 

It  is  easy  to  prove 

Theorem  t.  If  L is  normal  for  C and  LC  is  closed  then  L is  also 
essentially  normal  for  C. 

L can  be  said  to  be  completely  controllable  by  means  of  U iff 

LU=X, 

i.e.,  iff  the  image  LU  of  U is  a dense  subspace  of  X.  Then  we  have 

Theorem  2.  If  L is  normal  for  some  C cz  U then  L is  completely 
controllable  by  means  of  U.  Conversely  if  L is  completely  controllable  by 
means  of  U , then  L is  normal  for  any  C c U that  is  strictly  convex. 


4 

As  we  observed  at  the  beginning,  normality  was  introduced  as  a condition 
to  ensure  the  uniqueness  of  time  optimal  controls.  In  fact,  if  u e flXi  x is  a 
time  optimal  control,  then  Lu  belongs  to  the  boundary  of  the  attainable  set 
LBr  x;  hence  by  the  supporting  property  of  closed  convex  sets  in  E"  there 
must  exist  some  y ^ 0 satisfying  (1.2). 

The  same  holds  for  the  case  C = Bp  r- . 

Unfortunately  this  is  no  longer  true  for  (3.1)  in  the  general  case,  since 
a convex  set  LC  of  an  infinite-dimensional  space  X does  not  necessarily  have 
the  supporting  property.  In  other  words,  the  notion  of  normal  control 
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process  seems  to  be  confined  to  the  case  of  processes  for  which  a maximum 
principle  holds  [3]. 
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1.  Introduction 

Here  we  consider  a general  projection  method  for  obtaining  approxima- 
tions to  a hereditary  system.  We  shall  define  the  system  and  present  the 
approximating  systems  in  this  section.  The  main  results  are  stated  in 
Section  2. 

The  space  of  f?"-valued  square  integrable  functions  defined  on  an  interval 
[a,  b]  will  be  denoted  by  L2(a , b).  Let  r > 0 be  a fixed  real  number.  If 
x:[  — r,  +oo )-►/?"  is  given,  then  xr:[  — r,  0]  -*  Rn  is  defined  by  x,(.s)  = 
x(t  + s),  s e [ — r,  0],  for  each  t > 0. 

We  assume  that  L is  a linear  transformation  with  domain  in  the  linear 
space  of  Rn- valued  Lebesgue  measurable  functions  defined  on  [ — r,  0],  such 
that  L restricted  to  the  space  of  continuous  functions  is  a bounded  operator. 
As  in  [2],  L is  required  to  satisfy  the  following  conditions: 

(C)  If  r,  > 0 and  x e L2(-  r,  tY ),  then  the  function  g(t)  = L(x,)  is  defined 

almost  everywhere  on  [0,  f j],  and  depends  only  on  the  equivalence  class 

of  the  function  x.  Also,  g belongs  to  L,(0,  fj)  and  there  is  a continuous 

function  T such  that 

| |L(x,)|  ds  < T(f)||  |x(s)|2r/.sj  , for  all  r e [0,  rj. 

The  system  is  defined  by  the  linear  retarded  functional  differential 
equation 

x(t)  = L(x,)  + /(r),  t>0,  (1.1) 
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and  the  initial  data 

x(0)  = tj,  xo=0,  (1.2) 

where  rj  e Rn,  0 e L2(-r , 0),  and  fe  L2( 0,  f,)  for  every  f,  > 0. 

A solution  is  a function  x e L2(  — r,  ),  such  that  x is  absolutely  continuous 
(a.c.)  for  t > 0,  x satisfies  (1.1)  a.e.  on  [0,  fj,  x(0)  = rj,  and  x(s)  = 0(s ) 
a.e.  on  [ — r,  0].  In  [2]  it  is  shown  that  there  is  a unique  solution  to  (1.1) 
and  (1.2)  defined  on  [ — r,  oo),  and  this  solution  depends  continuously  on 
initial  data  (rj,  0)  e Rn  x L2(  — r,  0).  Let  Z denote  the  product  space 
Rn  x L2(  — r,  0)  and  for  r > 0 define  S(t):  Z-*Z  by  S(t)(rj , 0)  = (x(f),  x,), 
where  x is  the  unique  solution  to  system  (1.1)  and  (1.2)  with  f = 0.  The 
following  lemma  may  be  found  in  [2]. 

Lemma  1.1.  The  semigroup  S(t ) is  strongly  continuous  and  has  a closed, 
densely  defined  infinitesimal  generator.  If  ,c/  denotes  this  generator,  then: 

(i)  The  domain  of.c/  is  given  by  ^(.c/)  = {(rj,  0):  0 is  a.c.,0  e L2(-r,  0), 
and  t]  — 0(0)}. 

(ii)  If  ( t ],  0)  e £/(.</),  then  .c S(rj,  0)  = (L(0),  0). 

(iii)  There  exist  constants  y and  M such  that  the  spectrum  of  d lies 
to  the  left  of  the  line  Re  A < y,  and  |S(t)(r/,  0)|  < Me(y+I)l\(r},  0)|  for  each 
e > 0. 

In  order  to  define  the  approximating  systems  we  make  the  following 
assumption: 

(A)  There  exists  a sequence  of  finite-dimensional  subspaces  of  Z, 
denoted  by  ZN.  a sequence  of  projections  P" : Z -*  ZN,  and  a sequence  of 
linear  operators  .c/N:  ZN  ->  ZN  such  that: 

(i)  |exp  tfNt | is  uniformly  exponentially  bounded, 

(it)  | PN($.  0)  - (^,  0)|-O  for  all  ({,  0)e  Z, 

(iii)  I .tfNPN(Z,  0)  - V(«,  0)1  -0  for  all  ({,  0)  e &(.v/). 

Note  that  condition  (A)  implies  that  .c/v  "converges”  to  .</  in  the  sense 
of  Trotter  (see  [3,  6].)  Therefore,  we  introduce  the  approximating  system 
defined  by  the  ordinary  vector  differential  equation 

vv(r)  = .c/\v(0  + Py(f(t),  0),  t > 0,  (1.3) 

and  the  initial  value 

w(0)  = Ps(n,  0), 

where  0 denotes  the  zero  function  in  L2(  — r , 0). 


(1.4) 
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Let 

z*(r;/)  = exp  (.c /"t)PN(ri,  4>)  + | exp  [s/N{t  - o)]PN(f(o),  0)  do 

• o 

denote  the  solution  to  system  (1.3)  and  (1.4),  and  define  z(t;  f)  by 

= S(t)(ri , <t>)  +|  S(t  - o){J(o),  0 ) do. 

• o 


2.  Statement  of  Results 

We  now  present  our  main  results  concerning  the  representation  of 
solutions  of  (1.1)  and  (1.2),  and  convergence  of  approximating  solutions. 
Proofs  of  these  statements,  along  with  examples  and  an  application  to 
optimal  control  theory,  will  appear  in  [1], 

The  following  theorem  extends  the  results  stated  in  [2],  and  provides  an 
important  representation  theorem. 

Theorem  2.1.  Let  r,  >0,  ( r /,  (/»)  e f/(.c/),  and  JeL2( 0,  f,).  If  x(-;  /) 
denotes  the  solution  to  (1.1)  and  (1.2),  then  z(t;f)  = (x(t;f),  x,(  ;f)). 

If  assumption  (A)  is  made,  then  one  may  conclude  from  Trotter’s  theorem 
[3,  6]  that 

|exp  (.c /Nt)P"(Z,  ip)  - Sm , *A)|  ->0  for  all  ({,  iP)  e Z,  t > 0. 
Using  this  result,  we  can  establish  the  following. 

Theorem  2.2.  Let  f,  > 0 and  t e [0,  fj.  If  assumption  (A)  holds,  then: 

(i)  The  operator  F(t):  L2( 0,  t{)~*  Z defined  by 

F(t)J=  \ ‘s(t-o)(f(o)J))do 
• o 

is  a compact  linear  operator; 

(ii)  \zs(t;  J)  - z(t,  f)\->0  as  iV-»x,  uniformly  in  / for  /in  bounded 
subsets  of  L2( 0,  /,). 

Corollary  2.1.  If  assumption  (A)  holds  and  fK  converges  weakly  to  J 
in  L2( 0,  f,),  then  for  each  t,  zN(t',fK)->  z(t;f)  strongly  as  N,  K -*  x. 
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Corollary  2.2.  If  assumption  (A)  holds  and  ■? : L2(  0,  fj  -» ^([0,  f 2],  Z) 
is  defined  by  &[f]{t)  = then  is  a compact  affine  operator.  Here 

% is  the  space  of  Z-valued  functions,  continuous  on  [0,  fj,  taken  with  the 
supremum  norm. 

We  emphasize  that  the  approximating  system  is  largely  determined  by 
the  selection  of  the  spaces  ZN  and  the  projections  PN.  For  example,  by  an 
appropriate  choice  of  ZN  and  PN  we  can  obtain  a widely  used  finite- 
difference  scheme  for  approximating  solutions  to  (1.1)  and  (1.2).  Con- 
sequently, results  such  as  those  found  in  [4]  become  special  cases  of  the 
above  theorems. 

The  importance  of  Corollary  2.1  in  establishing  approximation  results 
for  optimal  control  problems  should  be  obvious.  A more  detailed  discussion 
of  this  aspect  of  these  results  will  be  given  in  [1].  An  application  of  the 
Trotter  theorem  to  the  problem  of  finding  approximate  optimal  controls 
for  another  type  of  infinite-dimensional  system  may  be  found  in  [5]. 
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1.  Statement  of  the  Problem  and  Basic  Results 


In  connection  with  optimal  heating  of  solids,  the  following  parabolic 
control  problem  arises  that  has  been  previously  treated  in  [1,  8].  We  are 
looking  for  a measurable  function  u on  a given  finite  interval  [0,  T],  T > 0, 
such  that  under  the  conditions 


|u(r)|  < 1 for  almost  every  t e [0,  T], 
y,(x,  t)  = yXJC(x,  t)  for  0 < t < T,  0 < x < 1, 


yJfK  t ) = o, 

ML  0 + y*(L  0 = bu(t) 
y(x,  0)  = 0 for 


for  t e (0,  T], 
x e [0,  1], 


the  quantity 

lb(-,  T)-  yT\\  = sup  |y(x,  r)-yT(x)| 

X € [0 , 1) 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 


is  to  be  minimized,  where  yT  is  a given  function  in  a suitable  function 
space  F[ 0,  1]  and  b is  a given  positive  constant. 

Yegorov  [8]  has  shown  that  for  each  ue  L°°[0,  T]  there  is  exactly  one 
generalized  solution  y = y(u,  x,  t)  of  (2)— (5)  such  that  for  y(u,  •,  T)  the 
representation  formula 

y(u,x,  T)  = f c°s/ikx  I exp[-nk2(T  - t)]u(t)dt  (7) 

‘0 

holds,  where  the  nk's  form  the  sequence  of  positive  solutions  of  the 
transcendental  equation  n tan  n = b and  the  >4k’s  are  given  by 


-4*  ~ 


2 sin  nk 

Hk  + sin  nk  cos  nk' 


k = l,  2, 
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Using  (7)  it  can  be  shown  that  by  defining  I 

(Su)(x)  = y(u,  x,  T),  x 6 [0,  1],  (8)  ' 

we  obtain  a linear  mapping  S:  £°[0,  T]->C[0,  1]  that  is  continuous  and  J 

compact.  Therefore,  it  is  sensible  to  choose  F[0,  1]  = C[ 0,  1].  - 

If  we  put 

Q = {ueL*[ 0,  T] : |u| < 1 a.e.},  (9) 

we  can  define  the  extreme  value  of  the  above  control  problem  by  i 

p(yT,Q)=  inf  ||Sm  — (10)  ! 

ueQ 

where  ||-||  denotes  the  maximum  norm  on  C[0,  1].  i 

An  optimal  control  u e Q is  then  defined  by  \ 

\\Su- yT\\  = p(yT,Q)-  ' 00  j 

Yegorov  [8]  has  further  shown  that  the  existence  of  an  optimal  control  j 

u e Q is  ensured.  By  recent  results  of  Week  [5]  the  additional  condition 

p(yr,  Q)>  0 implies  that  there  is  exactly  one  optimal  control  ue  Q that,  \ 

furthermore,  satisfies  the  “bang-bang”  condition  |u|  = 1 a.e.  on  [0,  Tj. 

The  purpose  of  this  chapter  is  to  indicate  how  lower  bounds  for  the 
extreme  value  (10)  of  the  control  problem  can  be  obtained  that  yield 
arbitrarily  close  estimates.  The  results  are  given  without  proofs,  which  can 
be  found  in  [2,  4]. 

Similar  approaches  for  obtaining  lower  bounds  of  the  extreme  value  of  : 

control  problems  with  distributed  parameters  have  been  given  by  Yavin 
[6,  7], 


2.  Lower  Bounds  for  the  Extreme  Value 
via  Dualizing  the  Problem 

2.1.  THE  DIRECT  APPROACH 

As  in  [4]  we  start  with  the  well-known  duality  formula 

p(yT,  Q)  = max  !y*(yr)  - sup  y*(Su)j  (12) 

y*  e 0*  I ue  Q I 

(see,  e.g.,  [3,  p.  62]),  where  S:  £*  [0,  T]  ->  Y = C[0,  1]  is  the  linear  mapping 
(8)  and  B * denotes  the  unit  ball  of  the  dual  space  Y * of  Y.  On  using  (7) 
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and  introducing  the  adjoint  mapping  S *:  Y* ->  L?[ 0,  T]*  one  can  verify 
for  every  u e £*[0,  T]  and  y*  e Y*  that 

(S*y*)(u)  = f T(SV)(t)u(<)  dl, 

• 0 

where 

(S*y*)(t)=  f Akpk2y*(cospk)c\p[-pk2(T-t)]  (13) 

k = 1 

is  in  l}[ 0,  T].  This  implies 

sup  y*(Su)  = ||S*y||  t,  (14) 

u e Q 

and  hence 

p(yr , Q)  = max  {y*(yT)  - ||  S*y* ||  t },  (15) 

y*  € B" 

where  ||  ||,  denotes  the  L‘-norm. 

Equation  (14)  also  holds  if  /^[O,  T ] is  replaced  by  C[0,  T],  which 
means  that  only  continuous  control  functions  are  admitted.  The  extreme 
value  (10)  then  is  not  changed;  however,  the  existence  of  optimal  controls 
is  in  general  no  more  ensured. 

If  y * e B*  is  a positive  linear  functional,  then  due  to  the  monotonicity  of 
the  mapping  (8),  it  follows  that  S*y*  given  by  (13)  is  a nonnegative 
function  in  L‘[0,  T]  thus  yielding  the  estimation 

y*M  ~ I AkY*( cos  /v)[l  “ exp  (~pk2T)]  < p(yT,  Q).  (16) 

k = 1 

2.2.  THE  INDIRECT  APPROACH 

Let  / be  the  set  of  all  the  continuous  linear  functionals  on  Y = C[0,  1] 
of  the  form 

i 

y*(y)=f  y*(*M*)  >,ey> 

Jo 

with  j,6y  and  ft  \ym{x)\dx  < 1. 

Then  by  [4],  we  have  for  the  extreme  value  (10) 
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where  again  S*y*  is  given  by  (13).  For  each  y*  e I the  function  S*y* 
can  be  computed  in  the  following  way:  Let  z = z(x,  t)  be  the  “classical 
solution”  of  the  “backward  initial  boundary  value  problem” 


(^,  0 0’ 

(x,  f)e  (0,  1)  x [0,  T), 

(18) 

Zx( 0,  t)  = 0 

Mi,  0 + Mi,  0 = o 

for  t e [0,  T), 

(19) 

z(x,  T)  = y*(x). 

xe  [0,  1], 

(20) 

By  applying  partial  integration  to 

T , 1 

I I [z,(x,  t)  + zxx(x,  f)]y(u,  x,  t)  dxdt  = 0 
- 0 * 0 


for  each  u e C[0,  T],  where  y(u,  •,  •)  is  the  corresponding  “classical” 
solution  of  (2)-(5),  it  can  be  easily  shown  that 


(SY)(«)  = f T(S*y*)(t)u(t)  dt  = b\'z(l,  t)u(t)  dt. 

J o J * 


(21) 


Defining  Z as  the  set  of  all  the  solutions  of  (18)-(20)  such  that 
z e C([0,  1]  x [0,  T]),  z,zxxeC(( 0,  1)  x [0,  T)), 

i 

zx(l,t)=  lim  zx(x,  t),  te  [0,7’),  and  ! \z(x,  T)\ dx  < l, 

x—1-0  -0 

we  conclude,  by  virtue  of  (17)  and  (21)  and  the  remark  following  Eq.  (15), 
that 


p(yT’  Q)  35  sup  J I z(x,  T)yT(x)  dx-b  | |z(l,  t)|  dt\.  (22) 

i-z  ro  Jo  I 

Taking,  for  instance, 


N 

z(x,  t)  = £ck  exp  (nk2t)  cos  nkx. 


k=  1 


we  have  that  ze  Z if  the  Ck's  are  chosen  such  that 
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Hence 


N . * 

£ Ck  exp  (nk2T)  | yT(x)  cos  pk  x dx 

k=  1 ‘0 


- b 


X Ck  exp  (nk2t)  cos  pk 


dt  < p(yT , Q)- 


*=  i 


(24) 


Furthermore,  it  can  be  shown  that  by  varying  N and  Ct,  CN  such  that 
(23)  holds,  the  supremum  of  the  left-hand  side  of  (24)  equals  p(yT,  Q). 


2.3.  CONCLUDING  REMARK 

By  using  the  duality  formula  (12)  it  can  be  shown  that  if  the  extreme 
value  (10)  of  the  control  problem  is  positive,  then  the  unique  optimal 
control  u can  be  characterized  in  terms  of  a more  refined  bang-bang 
principle  as  follows:  For  each  6 e (0,  T)  we  have  \u  \ = 1 on  [0,  T — S]  with 
the  exception  of  a finite  number  of  points,  i.e.,  the  jumps  of  it  are 
enumerable  and  can  only  accumulate  at  t = T (for  details,  see  [2]). 
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We  consider  systems  typified  by 

x(t)  = A_ix(t  - h)  + A0x(t)  + Axx(t  - h)  + Bu(t),  (1) 

where  h > 0,  the  A(  are  n x n constant  real  matrices,  and  B is  an  n x m 
constant  real  matrix.  The  control  vector  u takes  values  in  Rm  and  is 
assumed  to  be  in  L2  on  compact  intervals.  The  natural  state  space  of  x for 
such  systems  is  the  Sobolev  space  W(2 '([  - h,  0],  R")  of  absolutely  continuous 
functions  tf:  [-/i,  0]  -»  Rn  with  derivative  0'  in  L2. 

Considering  the  initial  time  t as  0,  we  denote  the  solution  x of  (1)  on 
t > 0 such  that  x0  = </>  by  x(\  </>,  u).  Here  xt  is  defined  by  xT((J)  = x(z  + Of 
0e[-h,  0j.  We  say  that  system  (1)  is  controllable  on  [0,  t]  iff  for  all 
ip  e W(2'  ’([  — /?")  there  exists  u e L2([0,  r],  Km)such  that  xt(  -,  </;,  u)  = i (/. 

Of  particular  interest  is  the  attainable  set  from  0 defined  by 

.c/(t)  = {(/>  = xr(-,  0,  u)\u  e L2([0,  t],  Rm)j. 

Obviously,  .c/(t)c:  fT2  ’([ - h,  0],  R")  and  it  is  easy  to  see  that  (1)  is 
controllable  on  [0,  r]  iff  x/(r)  = fT2 ’([  — /?,  0],  R").  Moreover,  control- 
lability of  (1)  on  [0,  r]  implies  t > h. 

For  any  integer  v > 1,  define  the  controllability  matrix  Cv  = [#,  A _ {B — , 
A'Sf  B].  One  can  prove  the  following: 

Proposition.  If  (1)  is  controllable  on  [0,  r]  and  p > I is  the  integer 
such  that  ph  < t < (p  + l)/i,  then  rank  Cp  = n ; in  particular,  if  r > nh,  then 
rank  C„  = n. 
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The  solutions  x(\  0,  u)  of  (1)  are  given  by  the  variation  of  constants 
formula 

t 

x(t , 0,  u)  = x(t,  0,  0)  + | <1>((  — s)Bu(.s)  ds,  t > 0, 

• 0 

where  O(t)  is  the  n x n transition  matrix  corresponding  to  (1).  The  system 
(1)  is  Euclidean  controllable  on  an  interval  [0,  a]  if  {0(-/2)|0  e x/(<r)}  = Rn. 
A necessary  and  sufficient  condition  for  (1)  to  be  Euclidean  controllable  on 
[0,  a]  is  that  the  controllability  Gramian 

a 

G(a)  = [ 0(s)BB*<t>*(s)  ds 
• o 

have  rank  n.  One  may  show  that  rank  C„  = n implies  rank  G(x  — h)  = n if 
t > nh,  but  not  if  r < nh.  Examples  show  that  rank  C„  = n is  not  by  itself 
a sufficient  condition  for  ( 1 ) to  be  controllable  on  [0,  t],  t > nh  (cf.  Example  3 
below).  However,  the  condition  is  sufficient  to  establish  a result  that  can 
often  be  substituted  for  controllability  (cf.  [2]).  That  is,  rank  C„  = n implies 
that  .cZ(t)1  has  finite  dimension  and  x/(x)  is  closed  in  W2  >([  — h,  0],  Rn ),  or 
what  is  the  same,  the  solution  operator  ui-»  xT(-,  0,  u)  has  closed  range  and 
finite  deficiency  in  0],  R")  (cf.  [4]).  Indeed,  this  property  persists 

for  quite  general  neutral  functional  differential  equations  of  the  form 

d &>(xt)  = L(x,)  + Bu(t),  (2) 

dt 

considered  by  Hale  and  Meyer  [3].  In  Eq.  (2),  3>  and  L are  linear  operators 
of  the  form 


o 


0e  0],  R"),  where  ij,  n are  n x n matrix  functions  of  bounded 

variation  on  [ - h,  0]  and  n(0)  = 0 with  n continuous  at  0.  Let  0(0)  - &((p)  = 
0(0)  and  let  0O(0)  = A_l(p(-h),  0e  W{2l)([-h,  0],  Rn).  Define  the  difference 
operator  by  C*0((j))  = 0(0)  - g0((j>). 

Theorem  t.  Let  t > h and  let  p be  the  integer  such  that  ph  < x < 
(p+l)/i.  If  rank  Cp  = n,  then  the  solution  operator  uh->xt(\  0,  u), 
u e L2([t0,  ti],  R")  corresponding  to 

d ®0x,  = L(x,)  + Bu(t) 
dt 


(3) 
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has  closed  range  and  finite  deficiency  in  W'<21,([  — h,  0],  R").  Moreover,  if 
g — g0  is  sufficiently  small  or  if  g = g0  + fib,  where  0,(0)  = * d/.(0)<p{0) 

and  /.  is  aniixn  matrix  function  whose  entries  are  in  W/22,([  — h,  0],  R),  then 
the  solution  operator  ut—»xT(\  0,  u),  ue  L2([0,  r],  Rn)  corresponding  to  (2) 
has  closed  range  and  finite  deficiency  in  — 0],  Rn). 

An  easy  corollary  to  Theorem  1 is  that  the  solution  operator  having 
closed  range  and  finite  deficiency  is  a generic  property  of  systems  of  the 
form  (2). 

Returning  now  to  the  controllability  problem  for  system  (1)  we  note  that 
in  [1]  it  was  shown  that  [for  system  (1)]  is  a nondecreasing  set-valued 
function  of  r that  is  constant  after  nh  In  view  of  this  we  confine  our 
attention  to  the  situations  t > nh  and  rank  C„  = n.  Using  operational 
techniques  developed  in  [I],  we  can  obtain  explicit  criteria  for  controllability 
of  (1)  under  these  circumstances.  The  nature  of  the  criteria  is  effectively 
demonstrated  in  the  case  m = 1,  that  is,  when  B is  a single  column  and 
u a scalar  function. 

In  applying  operational  techniques  it  is  convenient  to  introduce  certain 
function  spaces: 

X0  = {x:  (-oo,  t]  - R"|x(f)  = 0 for  t < 0 and  x|(0.t)  e W^'([0,  t],  R% 
U0  = { u : ( — oo,  r]  -*•  R\u(t)  = 0 for  t < 0 and  u|(0,  t)  e L2([0,  t],  R)}. 

For  f e X 0 or  f e U0  the  shift  operator  S is  defined  by  (Sf)(t)  =f(t-  h), 
t < t.  For  f e X0  the  derivative  operator  is  denoted  by  D,  so  that  ( Df)(t ) = 
/(f),  f < t (a.e.).  We  identify  S°  and  D°  with  the  identity  operator. 

The  extension  of  x(-,  0,  u)  to  a function  x e X0  satisfies 

(InD  - A_^SD  - A0  - AxS)x  = Bu,  (10) 

if  u here  denotes  the  extension  to  U0  of  the  control  u defined  on  [0,  r]. 
Treating  D and  S formally  as  scalars,  we  define 

n - 1 

P(D,  S ) = adj(/„D  - A.XSD  - A0  - AtS)  = £ Pj(D)Sj. 

J-o 

Here  adj  denotes  the  transposed  matrix  of  cofactors  so  the  Pj(D)  are  n x n 
matric  polynomials  in  D.  With  these  we  define  the  operator  matrix 

K(D)  = [ P0(D)B , Pl(D)B, P „-i(D)B], 

the  submatrices  Pj(D)B  being  formal  products  rather  than  the  results  of  the 
operators  Pj(D)  acting  on  the  constant  function  B. 

Using  some  ideas  from  [5],  we  have  proven  the  following  in  [1]: 


< \ 
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G_  j being  the  companion  matrix  for  the  characteristic  polynomial  of  A _y. 
For  any  matrix  M with  n rows,  let  \i  denote  the  submatrix  consisting  of 
the  first  n — 1 rows.  For  the  canonical  form  (6)  of  Eq.  ( 1 ) the  corresponding 
matrix  K(/.)  is  determined  by  G0  and  Gv  In  fact,  one  can  easily  show 
that  the  necessary  condition  in  Theorem  3 is  equivalent  to 

rank[7„/.  — G_l/.e~'h  — G0  — G1e,~/J']  = n — 1 

for  all  complex  /..  One  may  readily  apply  the  results  of  Theorems  2 and  3 
to  specific  cases.  The  following  examples  illustrate  the  methods. 


Example  1. 

Let  G,  - 

0 and  G0 

= [diag  (a „ .. 

-.On-l 

).  0],  If 

r > nh.  then 

(6)  is  controllable  on  [0. 

t]  iff  none 

of  the  a , , i = 

1 

n - 1, 

is  zero. 

Example  2. 

(n  = 3). 

Let 

0 0 

0 

0 

0 

b2 

<50  = 

and  Gj  = 

= 

- 1 a 

0 

0 

4 

0 

If  t > 3 h,  then  ( 

6)  is  controllable  on  [0.  t]  iff  h ^ 

0 and 

a * ± 

b - 3e*bh. 

Example  3. 

(n  = 2). 

Let  G0  = 

bo /a  - 

. Ift  > 

2k,  then  (6) 

is  controllable 

on  [0, 

t]  iff  [f0 

= 0,  /f,  -I-  x0 

+ 7.^ 

' * * o, 

or  Po  ^ 

Po  + + Pi)e 

/h  ^ 0 for  /.  such  that  (/.  - x0)(/. 

+ P t)  + Po  al 

= 0. 
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Lemma  1.  If  r > nh  and  B is  n x 1,  then  for  every  ip  e ’([  — h,  0],  Rn) 
there  exists  u e U0  such  that  the  solution  x e X0  of  (10)  satisfies  xt  = ip  iff 
there  exists  oj  e ^"’([r  - h , r],  Rn ) such  that 

K(Dy»(t)  = i p(t  - t),  f e [t  - h,  t],  (4a) 

D‘c)j(x  - h)  = i = 0, n 1 , j=l,...,n-  1,  (4b) 

where  o>,  denotes  the  jlh  component  of  oj. 

Using  this,  we  are  able  to  prove 


Theorem  2.  If  t > nh,  B is  n x 1,  and  rank  C„  = n,  then  system  (1)  is 
controllable  on  [0,  t]  iff  the  problem  (4a)  and  (4b)  has  a solution  for  every 
\\j  e W'2  ’([  — h,  0],  R"),  moreover,  this  holds  iff  the  homogeneous  problem, 
(4a)  with  ip  = 0 and  (4b),  has  only  the  trivial  solution. 

Under  the  assumption  that  rank  Cn  = n it  is  the  case  that  A(D)  - det  K(D) 
has  degree  n{n  - 1 ) in  D The  solutions  of  K(D)o)  = 0 thus  are  of  the  form 

nln-  1 ) 

oj(t)  = £ 'Ai(r) 

i=  i 

for  some  y‘  e R",  i = 1,  ....  n(n  — 1),  if  the  v(  form  a basis  for  the  scalar 
solutions  of  A(D)i  = 0.  Using  a basis  consisting  of  functions  of  the  form 
tV',  we  obtain 


Theorem  3,  If  r > nh,  B is  n x 1,  and  rank  C„  = n,  then  ( 1 ) is  controll- 
able on  [0,  r]  implies  that  for  all  complex  /.  either  A(/.)  ^ 0 or 


w(;.)  = K(X) 


. - /.h 


. {n  - 1 )/.h 


*0. 


(5) 


When  B is  n x 1 and  rank  C„  = n,  there  exists  a nonsingular  n x n constant 
matrix  T such  that  with  y = Tx  the  controllability  of  (1)  is  equivalent  to 
the  controllability  of 

(lnD-G.,SD -G0-GxS)y=  B0u,  (6) 
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Local  Controllability  of  a Hyperbolic  Partial 
Differential  Equation 
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Introduction 

Let  D be  a parallelepipedon  in  RN  with  boundary  B.  By  Hr(X),  we  mean 
the  (possibly  fractional)  Sobolev  space  of  order  r on  X.  We  let  T exceed  the 
diameter  of  D in  what  follows.  The  chief  result  we  report  is 


Theorem  1.  There  is  an  open  ball  V about  0 e H = H2(D)  x //‘(D) 
such  that  for  (gt,  g2)e  V there  exists  a unique  control  he  H3  2{B  x [0,  T ]) 
for  which  the  equation 


u„  = A u +J'(u , u,); 


= h: 


«*  lo,  ri 


w(x,  0)  = 0,  m,(x,  0)  = 0, 


0) 


has  a unique  solution  in  C([0,  T];  H)  and  u(x,  T ) = gt,  ur(x,  T)  = y2.  The 
nonlinear  function  / and  its  Frechet  derivative  are  each  assumed  to  be 
Lipschitz  continuous  at  0 e H. 


For /=  0,  Russell  [2]  has  established  Theorem  1 for  any  (gx,  g2)  e H and 
any  bounded  domain  D with  piecewise  smooth  boundary.  Our  approach  is 
to  use  Russell’s  result  for  the  linear  case  together  with  the  inverse  function 
theorem  to  show  that  the  nonlinear  solution  operator  at  time  T is  a local 
homeomorphism  at  0 e H. 

After  outlining  the  method  of  attack  that  leads  to  Theorem  1,  we  discuss 
some  numerical  work  that  illustrates  it. 
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Development  of  Theory 


It  is  convenient  to  rewrite  (1)  as 


vv  = Aw  + F(w); 


= /i;  w(0)  = 0, 

b *io. n 


(2) 


in  which  w(t)  = (u(<>,  r),  (u,(  , t))e  H,  F = ('}):  H -*  H,  and  A = (£  q)  is  an  un- 
bounded linear  operator  on  H.  and  h = (/j,  h,)  e H 1 2(B  + [0.  T]) 
x Hx  2(B  x [0,  T]). 

In  order  to  prove  that  (2)  has  a solution  in  C([0,  T];  H)  we  split  (2) 
into  two  problems: 


and 


v = Av; 


= h;  r(0)  = 0, 


fl*|0,  T| 


(3) 


z — Az  + F(v  + 2); 


= 0;  zy0)  = 0. 

fl«|0.  T] 


It  is  clear  that  the  solution  w of  (2)  is  obtained  by  w(f)  = v(t)  + 2(f). 
Our  first  step  is  to  establish 


(4) 


Theorem  A.  For  any  u e H,  there  is  a control 

hu  e H3,2(B  x [0,  T])  x Hi/2(B  x [0,  T]), 
such  that  (3)  has  a unique  solution  wu(t ) in  C([0,  7];  H)  with  wJT)  = u. 
One  then  can  use  standard  iterative  techniques  to  prove 


Theorem  B.  If  vu  denotes  the  solution  to  (3)  as  noted  in  Theorem  A, 
and  u is  sufficiently  small,  then 


z = A:  + F(z  + rj; 


= 0;  2(0)  = 0. 


(4') 


l«*io.  n 

has  a unique  solution  zu  e C([0,  T];  H). 

With  these  theorems  established,  we  note  that 

wu(t)  = vu(t)  + 2u(r) 


is  a solution  to  (2)  with  boundary  control  h = hu,  where  hu  is  the  control 
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that  steers  the  linear  problem  from  0 to  u.  Thus  wu(T)  is  not  exactly  m, 
but  “misses”  u due  to  the  presence  of  F. 

We  define  G:  U c H ->  H by  G(u)  = wu(T),  with  U some  open  set  about  0 
for  which  Theorem  B holds.  Note  that  G(u)  = u + zJT).  We  then  prove 

Theorem  C.  G is  continuously  Frechet  differentiable  at  0 e H,  G( 0)  = 0, 
and  G'(0)  is  a linear  homeomorphism. 

Once  Theorem  C is  established,  an  application  of  the  inverse  function 
theorem  finishes  the  proof  of  Theorem  1. 

Algorithm.  Suppose  we  are  given  the  system  (1)  with  desired  terminal 
state  (gu  g2 ) e H.  To  find  the  control  fi(gu  g2)  we  must 

(1)  Find  G-  ’(</,,  g2)  = (J\,f2). 

(2)  Compute  the  control  that  would  steer  (0,  0)  to  (J\,f2)  for  the  linear 
problem. 

In  practice,  we  can  estimate  G~l(u)  by  [G'(0)]''i/  for  small  u in  H. 
[G'(0)j  1 can  be  approximated  by  solving  a linear  integral  equation  on  //. 
Finding  the  controls  for  a linear  problem  is  a relatively  easy  matter  of 
solving  some  related  Cauchy  problems  (see  [1]  for  more  explanation). 

Numerical  results.  As  an  example,  the  following  system  was  solved: 

D = [0,  1]  x [0,  1]  cz  R\  T = 1.96, 

un  = uxx  + Uyy  + 0.1wr,  (g ,,  g2)  = (0.1  sin2  nx  sin2  ny,  0). 

Of  course,  the  term  0.1m,  is  not  a nonlinearity,  but  it  was  treated  as  a non- 
linear perturbation  to  test  the  algorithm.  Using  grid  sizes  of  Ax  = Ay  = 0.1, 
A T = 0.07  and  Ax  = Ay  = 0.05,  AT  = 0.035,  the  problem  was  solved  twice 
and  Richardson  extrapolation  applied  to  the  two  approximate  answers. 
The  resulting  approximate  control  steered  (0,  0)  to  a state  (u,,  u2)  at  time  T, 
and  if  we  define  the  relative  error  of  the  method 

p2  _ llMi  — 9i\\  + llW2  — ^2  111 

lift  lit  + Mi 

where  j|  / ||r  is  the  Sobolev  norm  of  order  r of /,  then  our  program  produced 
E2  = 0.036.  [The  crude  grid  sizes  necessitated  by  limited  computer  storage 
capacity  make  it  difficult  to  work  with  accuracy  in  a norm  so  exacting  as 
H2(D).] 
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Consider  the  control  system 

x(t)  = A(t)x(t)  + B(t)u(t),  (S) 

with  fixed  initial  data  x(f0)  = x0 . 

The  vector  x € R",  A and  B are  n x n-  and  n x m-matrix-valued  functions 
with  n — 2 and  n — 1 continuous  derivatives,  respectively,  and  u e Rm  is  a 
vector-valued  function  with  values  constrained  to  lie  in  the  unit  cube  U of 
Rm.  If  x(f,  u)  denotes  the  absolutely  continuous  solution  of  (S)  satisfying 
x(r0,  u)  = x0,  the  problem  is  to  determine  a control  u*,  subject  to  its 
constraints,  in  such  a way  that  the  solution  x = x(f,  u*)  of  (S)  satisfies  a 
termination  condition  Z(x,  r)  = 0 in  minimum  time  t*  ^ 0.  Such  a control 
will  be  termed  time  optimal. 

Let  X(t,t0)  be  the  principal  matrix  solution  (at  r0)of  the  matrix  differential 
equation 

X = A(t)X. 

Definition.  Each  m-vector-valued  function  y, 

y(t)  = c'X~l(t,  t0)B(t),  csR", 

will  be  called  an  index  of  the  control  system  (S). 

We  have  isolated  and  named  a concept  that  appears  implicitly  in  control 
theory.  The  connection  is  that  under  assumptions  of  normality  each  optimal 
control  u must  have  the  form 

u = sgn  y'  for  some  index  y. 

* Supported  in  part  by  1974  Cleveland  State  University  Research  Initiation  Award. 
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Proposition  t.  If  (S)  is  controllable,  then  its  indices  (or  rather  their 
transposes  y)  form  an  n-dimensional  subspace  of  the  solution  space  of  the 
nth  order  indicia!  equation  in  m-space 

(T) 

k-0 

with  continuous  m-square  matrices  Ak. 

If  (S)  is  normal,  then  each  Ak  can  be  taken  diagonal,  whereupon  the 
indices  determine  (T)  uniquely. 

(T)  will  be  called  the  indicial  equation  of  the  control  system  (S).  For 
obvious  reasons  we  will  call  (T)  uncoupled  if  the  matrices  Ak  are  diagonal. 


Examples  of  indicial  equation.  The  control  system  with  scalar  equa- 
tion 


x 3-  a(t)x  -1-  b(t)x  — u,  — 1 < u < 1, 

(1) 

has  the  indicial  equation 

y - ay  + {b  - a)y  = 0. 

(2) 

Analogously, 

x + ax  + bx  + cx  = u(t) 

(3) 

has  the  indicial  equation 

y - ay  + (b  - 2 d)y  -t-  (b  — c - a)y  = 0. 

(4) 

Incidentally,  (2)  is  the  adjoint  equation  of  (1)  [more  precisely  the  scalar 
equivalent  of  the  adjoint  equation  to  the  first-order  two-dimensional  version 
of  (1)].  (4)  is  the  adjoint  of  (3). 


Definition  1.  A collection  Y of  solutions  y of  (T),  all  defined  on  a 
common  interval  /,  will  be  termed  disconjugate  on  /,  if  every  member  y ^ 0, 
and  the  sum  of  the  number  of  roots  (on  /,  counting  multiplicities)  of  its  m 
coordinates  is  at  most  n — 1. 


Definition  2.  In  the  same  situation  Y is  disconjugate  at  a point  t0e  I 
if  it  is  disconjugate  in  some  neighborhood  of  f0  • Also  Y is  geometrically 
disconjugate  in  / if  it  is  such  relative  to  any  coordinate  basis  for  Rm  (i.e., 
if  MY  = {My:  ye  V}  is  disconjugate  for  any  nonsingular  constant  matrix  M). 
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Our  notion  of  disconjugacy  generalizes  that  employed  in  the  study  of 
scalar  nth  order  equations.  Ours  seems  more  applicable  to  control  theory 
than  the  concepts  and  results  known  to  the  authors  for  m > 1 (Coppel  [1] 
for  n = 2 and  Nehari  [3]  for  n = 1 and  general  m). 

The  set  of  solutions  of  (1)  has  a natural  structure  of  a linear  space  over 
the  reals,  with  dimension  nm.  We  will  be  concerned  with  its  linear  subspaces, 
and  primarily  with  subspaces  of  dimension  n.  Again  the  collection  of  these 
subspaces  has  a natural  structure  of  the  Grassmann  manifold  Mn(Rnm ),  an 
analytic  compact  manifold  of  dimension  n2m(m  — 1 ) (Warner  [4]). 

An  n-dimensional  subspace  L of  the  solution  space  of  (T)  can  be  described 
by  specifying  an  initial  time  t0  and  n constant  matrices  N0,  . ..,  N„_1  of 
type  m x n as  follows:  y e L if  and  only  if  there  exists  c e R"  such  that 

ylk){t0)  = Nkc,  for  k = 0, n - 1. 

With  this  description  we  have  the  following: 

Theorem.  L is  disconjugate  at  t0  if  and  only  if  the  following  holds: 
for  any  integers  r,  > 0 with  £7  = n,  the  n vectors 

ej'Nk,  1 <j<m,  0 < k < Tj  — 1, 
are  linearly  independent  (ej  is  the  /th  basic  unit  vector  in  Rm). 

Theorem.  The  collection  Y of  all  n-dimensional  subspaces  of  the 
solution  space  of  (T)  that  has  each  member  disconjugate  at  t0  is  open  and 
dense  in  the  Grassmann  manifold  Mn(Rnm). 

Non+  1 -dimensional  subspace  is  disconjugate  at  any  point. 

Definition  3.  The  system  (S)  is  strictly  normal  at  t0  if 
rankfP/j^fo):  1 < j < m,  0 < i < ri 1]  = n, 
for  all  choices  of  integers  r,  with  £”=  1 rj  = n-  operator  is 

T = -A(t)  + d/dt 

(with  T,+  l = TP  for  i =0,  1, ...)  [2]. 

Corollary.  The  collection  of  indices  of  (S)  is  disconjugate  at  t0  if  and 
only  if  (S)  is  strictly  normal  at  r0. 

As  a consequence  of  this  it  is  now  possible  to  construct  optimal  feedback 
controls  for  the  system  (S). 


310 


E.  N.  CHUKWU  AND  O.  HAJEK 


REFERENCES 


[1]  W.  A.  Coppel,  “Disconjugacy.”  Springer-Verlag,  Berlin  and  New  York,  1971. 

[2]  H.  Hermes  and  J.  P.  LaSalle,  "Functional  Analysis  and  Time  Optimal  Control.” 
Academic  Press,  New  York,  1969. 

[3]  Z.  Nehari,  Non-oscillation  and  disconjugacy  of  systems  of  linear  differential  equations, 
J.  Math.  Anal.  Appl.  42  (1973),  237  254. 

[4]  F.  W.  Warner,  “Foundations  of  Differential  Manifolds  and  Lie  Groups.”  Scott  Foresman, 
Glenview,  Illinois,  1971. 


Control  for  Linear  Volterra  Systems  without  Convexity 


THOMAS  S.  ANGELL 

Department  of  Mathematics 

University  of  Delaware,  Newark,  Delaware 


We  report  here  results  concerning  the  existence  of  optimal  and  optimal 
bang  bang  controls  for  systems  whose  dynamics  are  described  by  Volterra 
integral  equations  that  are  linear  in  the  state  variables.  Using  techniques 
similar  to  those  developed  by  Cesari  (see,  for  example,  [2,  4])  for  control 
systems  monitored  by  ordinary  and  partial  differential  equations,  we  are 
able  to  establish  a Neustadt-type  existence  theorem  for  usual  solutions  in  the 
absence  of  convexity  assumptions  (see  Neustadt  [6]).  We  also  state  a result 
concerning  the  existence  of  an  optimal  bang  bang  controller  when  the 
control  domain  is  fixed  and  compact.  Similar  techniques  have  been  used  by 
Suryanaryana  to  discuss  these  questions  for  linear  control  problems  with 
total  differential  equations  [7].  Existence  theorems  for  related  (nonlinear) 
systems  are  discussed  in  [1]  and,  in  a more  general  setting,  by  Warga  [8], 
Here,  the  particular  form  of  our  equations  allows  us  to  introduce  the 
notion  of  generalized  solution  in  a simple  manner,  and  to  avoid  convexity 
assumptions.  Unlike  [8],  we  also  consider  the  possibility  of  unbounded 
controls. 

Specifically,  we  consider  the  Volterra-type  integral  equation 

. t 

x(t)  = x(/J  + | [h(t,  .sjx(.s)  + g(t,  s)f(s,  m(.s))]  ds,  ( , < t < t2 , 

where  the  state  variable  x(f)  = (x1,  ...,  x")e  E"  and  the  control  variable 
u{t)  = (u1,  ...,  um)e  Em.  Let  A = [f,*,  /2*]  x Z c £nf  1 and  write  I A = 
[/,*,  /2*].  For  each  t e A,  let  U(t)a  Em  be  closed  and  let  M0  = 
{(f,  m)|/  e / 4,  w e U(t)\.  We  require  that  the  function /be  continuous  on  M0 
and  that  the  kernels  g and  h be  continuous  on  A = {(/,  s) : f,*  < ,v  < t < 
/2*J  and  zero  for  /,*  < t < s < t2*. 

We  consider  admissible  pairs  of  functions  {x,  u[,  each  pair  consisting  of 
a measurable  function  u:  [/,,  f2]  -» Em  and  a corresponding  function 
x : [f  i , f2]  -♦  E",  which  satisfies  the  Volterra  integral  equation 

x(t)  = x(tj)  + I [h{t,  v)x(.v)  + g(t , s)f{s,  u(.s))]  ds, 

(la) 

f,*  < fj  ^ t < t2  < t2 *, 
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subject  to  boundary  conditions 

t](x)  = (/,,  jc(f, ),  t2,x(t2))e  B = x B2,  (lb) 

where  B is  closed  in  E2n  + 2 with  c En+  1 compact,  as  well  as  constraints 

(t,  x(r))e/l  for  all  fe[f,,f2],  (lc) 

u(t)eU(t)  for  almost  all  fe[f,,f2].  (Id) 

We  assume  that  the  class  of  all  admissible  pairs  ft  is  nonempty,  and  consider 
the  Mayer  problem  min4i  I[x , u]  = minue^,  x(t ,),  /2,  x(f2)),  where  e is  a 
real-valued  function  defined  and  continuous  on  B a E 2n+2. 

If  {x,  ujeft  we  refer  to  u as  an  admissible  control  and  to  x as  an 
admissible  trajectory.  The  necessary  closure  and  compactness  properties  of 
the  set  of  admissible  trajectories  can  be  ensured  by  requesting  that  the 
function  / satisfy  a growth  condition  with  respect  to  a “comparison 
function”  H that  is  defined  and  continuous  on  M0.  More  precisely,  the 
function  / is  said  to  satisfy  the  growth  condition  (yH)  if,  for  any  r.  > 0, 
there  exists  a nonnegative  locally  L-integrable  function  t ]/,_  such  that 
| f(t,  u) | < ij/c(t)  + i.H(t , u)  for  all  (/,  u)  e M0 . 

Let  K > 0 be  a fixed  constant  and  denote  by  ftK  the  set  consisting  of  all 
{x,  u\  e ft  such  that  H(t,  u(t ))  is  L-integrable  and  jjj  H(t , u(t))  dt  < K.  Let 
R = \y{t)  = f(t,u(t))\{x,u\e  ft/<}  and  let  R be  the  set  of  extensions  of  elements 
of  R to  all  of  I A by  the  value  zero.  Let 

CM  = jz(f)  = | [h(t,  ,s)x(.sj  + g(t,  s)f(s,  u(.s))]  ds,  {x,  u]  e ft*| 

and  let  C°  be  the  set  of  extensions  of  elements  of  C°  to  all  of  I A by 
continuity  and  constancy.  Then,  as  a special  case  of  the  results  of  [1]  we 
have  the  following 


Theorem.  If  the  function  /satisfies  (yH),  the  set  R is  weakly  relatively 
compact  in  L,"(/ A)  and  C°  is  an  equicontinuous  equibounded  set. 

Associated  with  the  present  problem  is  the  orientor  field  Q„(t)  defined 
by  the  functions  / and  H,  namely, 

Q„(t)  = {(z°,  z)|z°  > H(t,  u),  z =f(t,  u),  u e U(t)} 

Under  the  hypothesis  that  the  set  Qn(t)  is  convex  for  all  t e I A,  one  may 
apply  variants  of  closure  theorems  proved  by  Cesari  [3]  to  prove  the  existence 
of  an  optimal  solution.  The  present  situation  is  particularly  simple  since 


aA 
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the  orientor  fields  do  not  depend  on  the  state  variable  and  no  additional 
seminormality  conditions  [e.g.,  the  upper  semicontinuity  condition  property 
(Q)]  are  needed. 

However,  the  linear  character  of  the  control  system  (la)-(ld)  allows  one 
to  establish  existence  without  assuming  convexity  of  the  QH(t).  We  introduce, 
for  nonconvex  QH(t\  the  concept  of  weak  solution  in  a manner  completely 
analogous  to  the  familiar  case  of  ordinary  differential  control  systems  (see, 
e.g.,  [2]  or  [5]).  That  is,  we  consider  triples  (x(r),  p(t ),  where  x(f)  = 

(x1,  ...,  x"),  p(t)  — ( p1 , ...,  pn+1)  is  measurable,  u*(t)  = (u(1),  ...,  m*"+1)), 
fi  <t  <t2,  and  for  / = 1,  n + 1,  p‘  > 0,  £"=/  p‘  = 1,  u(,)(r)e  U(t).  Let 
I(t,  P'  w*)  — (7i /„),  where 

Jj{U  p,  u*)  = X P%(U  u«°), 
i = 1 

and  let 

H*(t,  p,u*)= £ p‘H(t,  u(i)). 

i = 1 

Then  the  triple  {x,  p,  u*}  is  called  a generalized  solution  to  (la)-(ld) 
provided 

t 

x(f)  = x(t1)+  | [ h(t , s)x(s)  + g(t , s)/(s,  p(s),  u*(s))]  ds,  tl  < t < t2.  (2) 

Let  Q*  denote  the  class  of  all  generalized  solutions  such  that  t](x)  e B 
with  H*(t , p(f),  u*(r))  L-integrable  in  [tl5  t2]  and 

j H*(t,  p(t),  u*(t))  dt  < M. 

•i  i 

We  write  J[x , p,  n*]  = e(rj(x))  and  ask  for  the  absolute  minimum  j of  J in 
Q*.  Since  Q c=  Q*,j  < i = infn  I[x,  u].  Note  that,  by  Caratheodory’s  theorem, 
the  orientor  field  of  this  new  problem, 

Qh( 0 = {(A  z)|z°  > H*(t,  p,  u *),  z =f(t,  p,  w*)}, 

is  just  co  Qh{ t),  the  convex  hull  of  the  original  orientor  field.  It  is  easy 
to  check  that  the  assumption  of  the  growth  condition  (yH)  for  / implies 
that  7 satisfies  the  analogous  growth  condition  The  central  result  is 
the  following: 

Theorem  1.  Let  (x(t),  p(t),  u*(t)}  be  a generalized  solution  in  the  sense 
described  above.  Then  there  exists  a usual  solution  {x,  5}  e ClK  such  that 
>l(x)  = n(x). 
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We  indicate  a proof  of  this  theorem  as  follows.  If  r(t,  s)  is  the  resolvent 
kernel  associated  with  the  kernel  h(t , s),  we  may  represent  the  solution  of 
(2)  corresponding  to  u*  as 

*(0  = M'i)  + GK](0]  - | r(t,  s)G[u*](s)  ds, 

where 

G[u*](r)  = | g(t , s)/(s,  u*(s))  ds. 

• i, 

Then, 

x(t2)  = x(tx ) + X | g(t2 , s)pl(s)f(s , u,,,(-v))  ds 

1=1  -r, 

- | p‘(s)R(t2 , s)/(s,  w<0(.s))  Js 
• »> 

By  a Lyapunov-type  theorem  on  vector  integrals  (proved  by  Cesari  [4]), 
there  exists  a decomposition  of  [fl5  t2]  into  n + 1 disjoint  measurable  sets 
£l5  . . En+ 1 such  that 

*('2)  = x(h)  + Z I [g(t2<s)+ R{h,s)]f(S’uil)(s))ds. 

I = 1 • £, 

The  ordinary  control  function  u defined  by  u(t)  = u‘(t),  t e l = 1, 
n + 1,  generates  a trajectory  x with  the  same  ends  as  the  (generalized) 
trajectory  x.  This  result  leads  immediately  to  the  following  Neustadt-type 
existence  theorem. 

Theorem  2.  Let  the  sets  A,  B,  and  M0  be  as  described  above  and 
suppose: 

(i)  The  functions / = (/j,  ...,/„)  and  H are  continuous  on  M0  and  / 
satisfies  (yH); 

(ii)  The  sets  QH(t)  are  closed; 

(iii)  The  functions  h and  g are  continuous  on  A. 

Let  Q be  a nonempty  closed  class  of  admissible  pairs  such  that  H(t,  u(t))  is 
£-integrable  and  Jjj  H(t , u(t))  dt  < M x for  some  M,  > 0.  Then  the  cost  func- 
tional /(x,  u)  = e(rj(x))  has  an  absolute  minimum  in  ft. 

To  establish  this  theorem,  one  replaces  Qn(t)  with  Qh(i)  = c°  Qh(1)  as 
described  above,  and  considers  the  corresponding  optimization  problem  in 
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the  class  of  generalized  trajectories  Q*  with  cost  functional  J[x,  p,  u*].  We 
may  then  apply  the  existence  theorem  of  [1]  to  obtain  a generalized  optimal 
solution  {x0,  p0,  u0*[  e Q*.  Theorem  1 now  guarantees  the  existence  of  a 
usual  solution  {x,  i/}  e fl  that  is  optimal. 

These  techniques,  together  with  the  McShane-Warfield  lemma,  also  afford 
a result  on  the  existence  of  bang-bang  controllers  for  problems  in  which 
the  control  domain  is  a fixed  compact  set.  Specifically,  we  can  establish 
the  following: 

Theorem  3.  Let  A = [r0,  T]  x Z,  Z c and  let  U c=  Em  be  a fixed 
compact  set.  Let  g and  h be  continuous  for  t0  < s < t < T and  zero  for 
tQ  < t < s < T,  and  let  the  system  be  monitored  by 

. I 

*(/)  = x(r,)  + | [h(t,  s)x(s)  + g(t,  s)u(s)]  ds,  tx<t  <t2.  (3) 

If  {x,  u}  is  an  admissible  pair,  i.e.,  the  pair  {x,  u)  satisfies  (3)  as  well  as 
u(t)s  U,  te  [fl5  f2]  almost  everywhere,  and  (t,  x(t))  e A,  then  there  exists 
another  admissible  pair  {x,  ii}  defined  on  [f,,  t2],  such  that  xftjj^xftj), 
x(t2 ) = x(t2 ),  u(t)  edU,  te  [r„  t2]. 
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1.  Introduction 

In  this  chapter  we  shall  prove  that  the  system 

x(r)  = Ax(t)  + £ bt  c,Tx(f  - ih),  t > 0,  ( 1 ) 

i=  i 


has  the  property  that  if 


cfTx(/1)  = 0,  f]  > 0, 

for  all  solutions  corresponding  to  different  initial  functions  belonging  to 
C[  — mh,  0],  then  d T = 0.  This  property  is  called  pointwise  completeness  or 
pointwise  nondegeneracy.  The  physical  interpretation  of  this  result  is  that 
for  the  control  system 


x(t)  = Ax(t)  + Bu(t), 

where  A and  B are  n x n matrices  and  u(t)'  = u2(t).  u3(r) um(f)), 

there  is  no  state  feedback  control  of  the  form 

u,(f)  = c,Tx(r  - ih),  i = 1,  2,  3 m, 

that  ensures  the  vanishing  of  the  output  drx(t1).  The  case  with  = h2  = 
h3=-=hm  = b has  recently  been  proved  by  Asner  and  Halanay  [1], 
generalizing  the  method  of  Popov  [4],  who  proved  that  the  system 

x(t)  = Ax(t)  + bcJx(t  - h) 


is  pointwise  complete. 

Preceding  page  blank 
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2.  Preliminaries 


Definition  1.  The  delay  differential  system 

x(f)  = Ax(t)  + Bxx(t  - h)  + B2x(t  - 2h)  + •••  + Bmx(t  - mh ) (2) 

is  called  pointwise  degenerate  at  tx  with  respect  to  the  n-vector  d 7 # 0, 
if  for  all  initial  functions  (p  e C[-mh , 0],  the  corresponding  solution  satisfies 
the  relation 

d]x(t\  (p)~  0,  t > tx  > 0. 

If  the  system  is  not  pointwise  degenerate,  then  it  is  called  pointwise  complete. 
The  largest  set  of  real  numbers  at  which  the  system  is  pointwise  degenerate 
with  respect  to  d is  called  the  degeneracy  set  for  d. 

We  introduce  the  following  notation: 

Pi(s)  = (si  - A)  lB{  A(.v)  = G,(s)  A(.v),  i = 1,  2,  3, . . . , m, 

Ms)  = det  (si  -A), 

q0(s)  = /,  the  n x n identity  matrix, 

<li(s)  = qo(s)pAs),  ^ 

(h(s)  = (h(s)Pi(s)  + q0(s)p2(s)  A(.s), 

<h(s)  = q2(s)Pi(s)  + qi(s)p2(s)  A (s)  + q0(s)p3(s)(Ms))2, 

and  in  general, 

qk(s ) = qk-i(s)Pi(s)  + qk- 2(s)p 2(s)  A(.s)  + qk-  3(s)p3(s)(Ms))2  + • • • 

+ qk-Js)pm(s)(A(s)r  \ k = 1,  2,  3,  4 r,  (4) 

where  r is  a positive  integer.  qk(s)  = 0,  whenever  k is  a negative  integer,  and 


Pki(s)  = qk-i(s)Pt(s)  + qk  - 2(s)Pi+  i A(.v)  + ••• 

+ qk-m(s)Pi  + m-i(s)(Ms))m~  ',  i = 1,  2.  3 m,  k = 1,  2,  3 

(5) 


We  observe  that 


Pm(s)  = qk-i(s)Pi(s)  + qk-2(s)p2(s)  Ms)  + •"  + qk- m(s)pm(s)(Ms))m  1 
= qk(s)- 
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3.  Representation  for  Solution  of  Eq.  (2)  for  a Special  Class 
of  Initial  Functions 

Consider  the  initial  function  <y(  )e  C[  — mh,  0]  given  by 
y(t)  = (MD))k4>Ul  D = d/dt , 

where  A:  is  a positive  integer  >n,  and  0(  ) is  a function  belonging  to 
C"*[  — mh.  0]  with  the  property 

D'4>(  - ih ) = 0,  i — 0,  1,  2,  3 m;  r = 0.  1,  2,  3.  . . . k. 

Then  the  representation  for  the  solution  of  Eq.  (2)  in  the  first  interval  for  the 
initial  function  </(  •)  defined  above  is  given  by 

i = 1 

[pu(D)  as  defined  in  (5)].  This  follows  from  the  fact  that  Xo«i  Pu(D) 
x (A(D))*“  '4>(t  - ih)  satisfies  the  equation 

(DI-A)x(t)=  f B,(A(D))V(f  - ih) 

i = i 

[using  Eq.  (3)]  in  the  first  interval,  and,  due  to  the  fact  that  Dr(p(-ih)  = 0. 

i =0,  1,  2 m,  r = 0,  1,  2,  3,  nk,  the  continuity  condition  of  the 

solution  at  the  origin  is  satisfied.  Similarly,  the  representation  for  the 
solution  of  Eq.  (2)  in  the  second  interval  is  given  by 

*(';  y)  = />i(  f.Pu(D)(MD))k~2<j)(t  - (i  + i)/i)| 

+ tpu(D)(MD))klMt-ih) 

1 = 2 

= f P2i(D)(MD)r  2<t>0  ~ (i  + I )/»),  h < t < 2/1,  (6) 

■ = i 

where  p2i(D)<  ' = 1.  2,  3,  ...,  m,  are  defined  in  Eq.  (5),  and  in  general 
the  representation  for  the  solution  of  Eq.  (2)  in  the  rth  interval  is  given  by 

*(':</)=  jr  pr,(D)(A(D))*~r0(f  — (i  + (r  — \))h),  (r  - l)h  < t < rh.  (7) 

i - 1 


T 
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4.  Necessary  Conditions  of  Pointwise  Degeneracy 

The  necessary  condition  that  the  system  (2)  is  pointwise  degenerate  with 
respect  to  a nonzero  n-vector  d at  the  point  f,,  (r  — 1 )h  <t , < rh,  is  given 
by 

d'pri(D ) = 0,  / = 1,  2,  3,  m, 

where  pri(D)  is  defined  in  Eq.  (5). 

Proof.  We  observe  that  pri(D ) is  an  n x n matrix  all  of  whose  elements 
are  polynomials  in  D of  degree  less  than  or  equal  to  (rn  - 1).  Hence, 
from  Eq.  (7)  d’x(t;  y)  in  the  rth  interval  for  the  above  special  class  of 
initial  functions  can  be  expressed  in  the  form 

* 

dJx(f,  y)  = X I S]riDjd>(t  - (i  + (r  - 1))/!),  (r  - \)h  < t < rh,  (8) 

i = 1 j = o 

where  Sjri  is  an  /i-dimensional  column  vector  and  Sjri  denotes  its  transpose. 
Let  us  choose  the  initial  function  such  that 

D>4>(t0  - (/  + (r  - 1 ))h)  = Sjri, 

7 = 0,  1,2 kn  - 1,  i = 1,  2,  m,  tl<t0<rh,  f,  > (r  - l)h. 

Hence  from  Eq.  (8),  it  follows  that 

dJx(t0\y)=  £ £ (Sjri)(SJrl), 

i = 1 7 = 0 

which  contradicts  the  fact  the  system  (2)  is  pointwise  degenerate  at  t ,.  Hence 
( = 0,7  = 0,  I,  2 nk  - 1,  i = 1,  2 m,  which  proves  the  result. 

I 

P 

5.  Consequences  of  B,  = /),c,T  on  the  Necessary  Conditions  of 
Pointwise  Degeneracy 

. Theorem  5.1.  Suppose  that  i = 1,  2,  3,  ....  m\  then 

d'pr,(D)  = 0,  r > 2,  i = 1,  2,  3 m,  implies  that  d'p2i(D ) = 0,  / = 1,  2, 

3 m,  where  pJD),  p2i{D ) are  defined  in  Eq.  (5). 
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Proof.  Using  Eq.  (5)  we  observe  that  dl  pri(D)  = 0,  i = 1,  2 m,  can  be 

expressed  as  a set  of  m equations  as  follows: 

dTqr.l(D)p(D)blclJMD)  + ■■■  + dlqr_m(D)p(D)bmcm'(MD)r  = 0.  (9) 

^,^-1(0»2^t(A(D))2  + - 

+ d'qr_m+l(D)p(D)bmcm'(A(D)r  = 0.  (10) 

A,-i(O)p(O)hM-1c:_1(A(D)r-,  + dT</r_2(D)p(D)bwc(?I,(A(D)r=0.  (11) 

d\r-l(D)p(D)bmcm'(MD))m  = 0.  (12) 
where  p(D)  = (DI  - A)~  *,  A(D ) = det  (DI  - A). 


Case  1.  Let  c,,  c2,  cm  be  linearly  independent.  Then  it  follows  from 
Eqs.  (9)-(  12)  that 


^r-i(O)p(D)h,=0, 

dTqr_2(D)p(D)bt=0, 

Ar-,m-i,P)p(%  = 0, 
d'qr_JD)p(D)bi  = 0, 


/ = m,  (m  — 1),  (m  — 2) 3,  2,  1, 

i = m,(m  - 1 ) 3.  2, 

i = m,  (m  — 1 ), 
i = m. 


(13) 


as  d'qr^j(D)p(D)bi,  j = 1,  2,  3 m,  i = m,  (m  - 1 ),...,  3,  2,  1,  are  scalar 

quantities.  Let  us  define  the  following  variables: 

cu  = ci'p(D)bj(A(D))i-\  i = 1,  2,  3 m,  j = 1,  2,  3 m 

(14) 

>j  = dfqr-2-J(D)p(D)bi(A(D))‘-  \ / = 1,  2,  3 m,  j = 0,  1,  2,  3 m 

(15) 

We  observe  from  Eqs.  (4)  and  ( 1 3 )- ( 1 5 ) that 

>0  = (y.'c,,  + y,2c2l  + •■•  + yrcm,mD)t  \ i=l2. m,  (16) 


and 


>’o‘cn  + y02c2l  + •••  + y0wcmi  = 0,  i = m,m-  1 3,2,1,  (17) 

>’i ‘cu  + y^cn  + •••  + yimcmi , = 0,  i = m,  m — 1 3,2,  (18) 


y'm-2  + )m  - 2 C2i  + * ‘ ’ + >Z- 2 Cmi  = °.  I = ttl,  W - 1, 

Vm-  lClm  4"  >m-  1*  2m  + ’ ’ ' + >*-  j C'mm  = 0. 


(19) 

(20) 


If  rank(c(y)  = n,  it  follows  from  Eq.  (17)  that  >0'  = 0,  / = 1,  2,  3,  ...,  m, 
showing  that  Eq.  (13)  is  satisfied  if  r is  replaced  by  (r  - 1).  Hence 
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t fpri(D ) = 0 will  imply  that  </r/?(r _ 1)f(D)  = 0, / = 1,2, 3, m,  and  the  process 
can  be  continued  until  dTp2i(D)  = 0,  i = 1,  2,  3,  m.  If  rank  (cy)  is 

less  than  n , then  using  Eqs.  (16)-(20)  the  above  theorem  can  be  proved. 

Case  2.  If  c„  c2 , . . . , cm  are  not  all  independent,  then  also  the  above  can 
be  proved,  but  the  proof  involves  a little  more  algebra. 

Theorem  5.2.  If  the  system  (1)  is  pointwise  degenerate  with  respect  to 
the  n-vector  d / 0,  and  if  drp2i(D)  = 0,  i=  1,  2,  3,  ...,  m,  then  the 
degeneracy  set  is  [h,  oo). 

Proof.  drp2i(D)  = 0,  i = 1,  2,  3, ...,  m,  implies  that 

J1(G1(<t)G<(<t)  + G(.+  1(<t))  = 0,  i=l,2,  ...,m,  Bm+I  = 0,  (21) 

where  G,(a)  are  defined  in  Eq.  (3).  Observing  that  <tG,(<t)  = ^G^rr)  + B,, 
G,(< j)  -»0  as  a -+  oo,  and  multiplying  Eq.  (21)  successively  by  powers  of  cr 
and  taking  the  limit  as  a -»  oo,  we  have  the  following  relations: 

d7  (AjBi+ , + PjBj)  = 0,  Pj  = Aj~  lB{  + Pj-  tA,  P0  = P-t=  0,  (22) 

i = 1,  2,  3,  . . . , m,  j = 0,  1,  2,  3,  . . . , m,  . . . , oo. 

Differentiating  Eq.  (1)  successively  and  using  Eq.  (22),  we  see  that 
y(t)  = d'x(t;  y)  satisfies  the  following  equations  for  / > h: 

y>{t)  = d1  xj(t)  = d' Ajx(t)  + d] Pjx{t  — h),  t > h,  j = 1,  2, ...,  oo.  (23) 

Hence  from  Eq.  (23),  it  follows  that  for  j > 2 n,  there  exists  a nonzero 
j- vector  E'  — (clt  e2,  e3,  ...,  ej)  such  that 

ejy*  + ej.  1 + •••  + exy  = 0,  t > h,  j > In.  (24) 

Now  y(t)  satisfies  the  ordinary  differential  equation  (24)  everywhere  for 
t > hand  y(f)  = 0,  for  t > (r  - \)h  [system  (1)  is  degenerate];  hence  it  follows 
that  y(t)  = 0,  for  t > h.  But  this  is  impossible  (see  [2,  4]).  Hence  the 
system  (2)  is  pointwise  complete. 
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A Perturbation  Method  for  the  Solution  of  an  Optimal 
Control  Problem  Involving  Bang-Bang  Control 


MARVIN  I.  FREEDMAN  and  JAMES  L.  KAPLAN 

Department  of  Mathematics 

Boston  University.  Boston.  Massachusetts 


In  the  newly  emerging  field  of  perturbation  analysis  for  optimal  control 
one  typically  looks  at  a system  of  differential  equations  with  data  depending 
on  a small  parameter  e,  which  arises  as  the  Euler-Lagrange  equation  of 
some  underlying  optimal  control  problem.  (See,  for  example,  the  papers  by 
Kelly  [1],  Kokotovic  and  Sannuti  [2],  O’Malley  [3],  and  Hadlock  [4]).  In  the 
regular  case  the  data  depend  on  t:  in  a smooth  way,  while  in  the  singular 
case  the  order  of  the  system  is  reduced  upon  setting  e = 0.  The  objective  of 
such  studies  has  generally  been  twofold.  First,  one  develops  formal  tech- 
niques for  determining  asymptotic  expansions  in  powers  of  £ for  the  state 
variables,  the  adjoint  response  (costate  variables),  and  possibly  for  the 
optimal  control  variable.  (In  the  singular  case,  so-called  boundary  layer 
expansions  in  stretched  variables  are  required  in  addition  to  the  usual  time 
variable.)  Second,  one  demonstrates  that  the  asymptotic  series  found  are 
uniformly  valid  asymptotic  expansions. 

In  [5]  we  treat  such  a problem  for  the  system 

x =/(x,  e)  + cu,  (la) 

X = g(x.  A,  c),  (lb) 

on  the  interval  [0,  7],  together  with  the  boundary  conditions 

x(0,  e)  = a(e),  (1c) 

X(T,e)  = b(e).  (Id) 

In  the  above  • denotes  d/dt,  c is  a small,  positive,  real  parameter, 
x,  A,/,  g,  c g R ",  and  u is  a measurable  function  on  [0,  T]  with  values  in 
[-  1,  1],  We  may  imagine  system  (1)  arising  as  the  result  of  an  application 
of  the  maximal  principal  to  some  underlying  optimal  control  problem  in  the 
variable  x.  The  variable  / may  be  thought  of  as  the  “costate  variable” 
corresponding  to  x.  The  function  u(t,  e)  appearing  in  (la)  will  be  viewed 
as  the  optimal  choice  for  the  underlying  control  problem;  that  is,  it 
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represents  the  choice  of  admissible  controller  that  maximizes  the  appropriate 
Hamiltonian  function.  We  will  suppose  that  u(t,  c)  takes  the  form 

u(f,  c)  = sgn(c  • A(t,  e))  = c • A(t,  e)/| c(t)  ■ A(t,  e)|.  (1c) 

Because  of  space  limitations,  we  will  demonstrate  our  ideas  here  by 
applying  them  to  a specific  optimal  control  problem.  For  the  complete 
analysis,  including  asymptotic  validity  of  the  procedure  illustrated,  see  [5]. 

Consider  the  problem  of  controlling  a particle  whose  trajectory  is 
determined  by 


z(i)  + ez2(r)  = u(t),  |u(r)|  < 1,  re[0,  3], 

starting  at  the  initial  point 

z(0)  = 5 + e,  z(0)  =1+2 £. 

Say,  for  instance,  that  we  wish  to  maximize  the  functional 

c(u)  = z(3,  e)  - z(3,  e). 

In  system  notation,  if  we  let  x(t)  = z(/),  y(r),  z(t)  we  obtain 


X = y. 

(2a) 

y = —ex2  + u. 

MOI  ^ i 

(2b) 

x(0)  = \ + £, 

(3a) 

y(0)  = 1 + 2£. 

(3b) 

The  Hamiltonian  function  corresponding  to  system  (2)  is  given  by 

H(x,  y,  a,  v,  u,  e)  = Ay  + v(-<;x2  + u), 

from  which  the  corresponding  equations  for  the  costate  variables  A and 
v are  determined.  They  are 

/ = 2r;x,  (4a) 

v=  -A.  (4b) 

The  costate  variables  must  satisfy  the  boundary  conditions 

A(3)=-l,  (5a) 

v(3)  = 1,  (5b) 

which  are  independent  of  £.  When  c = 0,  we  may  easily  obtain  an  explicit 
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solution  to  the  problem  (2)~(5),  which  we  will  denote  by  x0(t),  >0(i), 

mo,  m«(0- is  g'ven  by 

} -\t2  -f  t + {,  0 < t < 2, 

|£/2-3i  + f,  2 < I < 3, 

| - f + 1,  0 < i < 2, 

U - 3,  2 < i < 3, 

Ao(0=  -1. 

v0(f)  = 1-  2, 

u0(t)  = sgn  vo(0  = v0(f)/|v0(f)|  = j + J'  2 < f < 3 

[We  recognize  that  this  is  also  the  solution  of  the  problem  of  bringing  to 
rest  at  the  origin  in  minimum  time  a trolley  that  starts  at  (z( 0),  z(0))  = 
(\,  1)  and  moves  along  a horizontal  track  with  negligible  friction.] 

Now,  let  f(e)  denote  the  switch  time  of  u(t,  i ;)  for  the  perturbed 
equations  (5)  (8).  Let 

t = h{x,  e)  = - x)t(c)  + x(x  - 2). 


XO(0  = 
yoU)  = 


This  Lagrange  interpolating  polynomial  will  freeze  the  switch  point  of  the 
control  at  x = 2,  while  leaving  the  initial  and  terminal  times  fixed.  Now  define 
.Y(t,  /;)  = x(h( x,  e),  e),  and  define  Y(x,  e),  A(t,  /:),  N(r,  e),  U(x,  e)  similarly. 
Thus 


U(x,e)  = 


-1, 

+ 1, 


0 < x < 2, 
2 < x < 3. 


Substituting  these  new  variables  into  (5)  (8)  results  in 

dX/dx  = [5(3  - 2x)t(e)  + (2t  - 2)]Y(x,  e), 
dY/dx  = [j(3  - 2t)i (<;)  + (2t  - 2)][-/;A'2(t,  <;)  + U(x,  e)], 
dA/dx  = [5(3  - 2x)f(e)  + (2r  - 2)][2eA’(r,  «)N(r,  e)], 
dN/dx  = Q(3  - 2t)f(/;)  + (2t  - 2)]A(t,  e). 

Observe  that  X (0,  e)  = x(h( 0,  f.))  = x(0,  e)  = ^ + e,  while  T(0,  l)  = y(0,  /;)  = 

1 + 2e.  Thus,  our  original  initial  conditions  are  preserved  in  the  new  variable 
t.  In  addition,  we  have  right  boundary  conditions  of  the  form  A(3)  = — 1, 
N(3)  = 1.  Now  let  us  suppose  that  each  of  the  variables  appearing  in  the 
transformed  system  possesses  an  asymptotic  series  expansion  in  powers  of  e. 
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Thus 

X(x,  t.)  = A'o(r)  + i.Xx(z)  + c2X2(x)  + ••■,  /(/;)  = t0  + /;/,  + t:2l2  + •••, 

with  similar  expressions  for  Y(x,  <:),  A(t,  e),  and  N(r,  e).  Observe  that  we 
must  have  X0(x)  = x0(r),  Y()( r)  = y0( t),  A0(t)  = /0(r),  N0(t)  = v0(r),  and 

to  = 2. 

Next,  let  us  insert  the  assumed  series  expansions  into  (6).  We  then 
differentiate  the  resulting  equations  with  respect  to  e,  and  set  i:  = 0.  This 
yields 


dXx/dx  = V'j(t)  + [^(3  - 2T)f,]V0(T), 

(7a) 

JYJdx  = -X02(x)  + [i(3  - 2x)t X]U 0(x), 

(7b) 

dAx/dx  = 2A'0(t)N0(t), 

(7c) 

dNx/dx  = — A,(r)  - [|(3  - 2t)/,]A0(t). 

(7d) 

The  appropriate  boundary  conditions  for  the  first  two  equations  in 

(7)  are 

given  by 

*i(o)=i,  y,(o)  = 2. 

(8) 

For  the  second 

pair  of  equations,  we  must  have 

A,  (3)  = 0,  N,(3)  = 0. 

(9) 

In  addition,  since  U(x,  k)  = sgn  N(t,  c),  we  know  that  we  must  have 
N(2,  i:)  = 0.  We  may  now  observe  that  system  (7)  may  be  solved  by 
quadrature.  Moreover,  if  we  solve  (7d),  we  can  set  N x( 2)  = 0 to 
determine  the  value  of  / , . Thus  we  can  determine  the  coefficients  in  the 
asymptotic  series  expansion  of  the  switch  time.  Using  this  value,  the  solution 
A",,  Tj,  A,,  Nj,  U,  is  known  completely.  We  may  then  differentiate  (7) 
once  again  with  respect  to  c and  set  i:  = 0,  to  determine  the  next  terms  in 
the  expansions. 

The  actual  value  computed  for  I,  for  the  problem  treated  here  is 
f,  = 76.6 Thus  we  have 

r(c)  = 2 + 76.6s  + 0(r,2), 
and  the  relationship  t = h(x,  r.)  takes  the  form 

t = r + e[38.3r(3  — t)]  + 0(c2). 

The  expansions  (in  the  variable  r)  found  as  indicated  above  will  be 
uniformly  valid  on  0 < t < 3. 
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We  conclude  by  remarking  that  the  technique  we  have  demonstrated  for 
a control  problem  involving  one  switch  is  indicative  of  a more  general 
technique  that  is  applicable  to  a wide  variety  of  practical  control  problems 
involving  multiple  switches.  This  will  necessitate  the  use  of  a higher  degree 
Lagrange  interpolation  polynomial.  Other  bang-bang  control  problems  can 
be  treated  similarly.  In  the  case  of  a singular  bang-bang  control  problem, 
one  expects  boundary  layer  behavior  at  the  switch  points  as  well  as  at  the 
initial  and  terminal  times.  For  the  minimum-time  problem,  it  is  necessary 
to  choose  a Lagrange  interpolation  polynomial  that  “freezes”  the  terminal 
time  as  well  as  the  switch  time.  These  ideas  will  be  explored  in  detail 
in  forthcoming  papers. 
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Introduction 


A sufficiency  theorem  is  established  for  the  problem  of  minimizing  a 
functional  of  the  form 


F(C)  = g(b)  + 


. T2 

I M[t,  L(t , x(t),  u)]  dt 

• T' 


over  a class  of  terminally  and  differentiably  admissible  generalized  arcs 
(generalized  arc  = boundary  parameters  -I-  trajectory  + relaxed  control). 


Description  of  Results 

This  sufficiency  theorem  applies  to  deterministic  as  well  as  relaxed 
control  problems  and  includes  several  known  sufficiency  results  as  special 
cases.  Consequently,  the  proof,  which  is  by  a new  type  of  expansion 
argument, demonstrates  that  useful  sufficiency  results  can  be  derived  without 
highly  sophisticated  mathematics.  Hypotheses  are  made  with  respect  to 
transversality,  the  costate  (or  Euler- Lagrange)  equations,  and  a strengthened 
maximum  principle.  Several  illustrative  examples  are  given  and  reference  is 
made  to  some  of  the  many  places  in  the  literature  where  derivation  of  a 
problem’s  extremals  using  necessary  conditions  (the  maximum  principle) 
is  tantamount  to  verifying  these  sufficiency  hypotheses. 

The  details  of  proof  and  the  application  of  this  fixed-time-domain  result 
to  minimum-time  problems  will  be  given  in  later  work. 
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